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direcdy from any ulllSSSS - ~^P^ app.ica.ion. 
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However it may be difficult to discriminate between closely related species 
when using primers derived from the 16S rRNA. In some instances 16S rRNA. 
sequence identity may not be sufficient to guarantee species identity (Fox et aL, 
1992 Int J. Syst. Bacteriol. 42:166-170) and it has been shown that inter-operon 
sequence variation as well as strain to strain variation could undermine the 
application of 16S rRNA for identification purposes (Clayton et al., 1995, Int. J. Syst. 
Bacteriol. 45:595-599). 



STATEMENT OF THE INVENTION 



It is an object of the present invention to provide a specific, ubiquitous and 
sensitive method using probes and/or amplification primers for determining the 
presence and/or amount of nucleic acids: 

- from any bacterial, fungal or parasitical species 

in any sample suspected of containing said nucleic acids, and optionally, 

- from specific microbial species or genera selected from the group consisting 
of the species or genera listed in Table 4 

- from an antibioticjresistance gene-selected from the group consisting of the 
genes listed in Table 5, and optionally, 

- from a toxin gene selected from the group consisting of the genes listed in 
Table 6, 

wherein each of said nucleic acids or a variant or part thereof comprises a 
selected target region hybridizable with said probe or primers; 

said method comprising the steps of contacting said sample with said probes or 
primers and detecting the presence and/or amount of hybridized probes or amplified 
products as an indication of the presence and/or amount of said any microbial 
species, specific microbial species or genus or family or group and antibiotic 
resistance gene and/or toxin gene. 

In a specific embodiment, a similar method directed to each specific microbial 
species or genus or family or group detection and identification, antibiotic resistance 
genes detection, toxin genes detection, and universal bacterial detection, separately, is 
provided. 

In a more specific embodiment, the method makes use of DNA fragments from 
conserved genes (proprietary sequences and sequences obtained from public 
databases), selected for their capacity to sensitively, specifically and ubiquitously 
detect the targeted bacterial, fungal or parasitical nucleic acids. 

In a particularly preferred embodiment, oligonucleotides of at least 12 

8 
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nucleotides in length have been derived from the longer DNA fragments, and are 
used in the present method as probes or amplification primers. 

In another particularly preferred embodiment, oligonucleotides primers and 
probes of at least 12 nucleotides in length are designed for their specificity and 
5 ubiquity based upon analysis of our databases of tuf, atpD and recA sequences. These 
databases are generated using both proprietary and public sequence information. 
Altogether, these databases form a sequence repertory useful for the design of 
primers and probes for the detection and identification of bacterial, fungal and 
parasitical microorganisms. The repertory can also be subdivided into subrepertories 
10 for analysis leading to the design of primers and probes. 

The tuf, atpD and recA sequences databases as a product to assist the design of 
oligonucleotides primers and probes for the detection and identification of bacterial, 
fungal and parasitical microorganisms are also an object of this invention. 

The proprietary oligonucleotides (probes and primers) are also another object 

15 of the invention. 

Diagnostic kits comprising probes or amplification primers for the detection of 
a microbial species or genus or family or group selected from the following list 
consisting of Bordetella sp., Candida albicans, Candida dubliniensis, Candida sp., 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium sp., Corynebacterium 

20 sp., Cryptosporidium parvum, Entamoeba sp., Enterobacteriaceae group, 
Enterococcus casseliflavus-flavescens-gallinarum group, Enterococcus faecalis, 
Enterococcus faecium, Enterococcus gallinarum, Enterococcus sp., Escherichia coli, 
Giardia sp., Haemophilus influenzae, Trypanosomatidae family, Leishmania sp., 
Mycobacteriaceae family, Neisseria gonorrhoeae, platelets contaminants group (see 

25 Table 14), Pseudomonads group, Staphylococcus aureus, Staphylococcus 
epidermidis. Staphylococcus haemolyticus, Staphylococcus hominis, Staphylococcus 
saprophyticus, Staphylococcus sp., Streptococcus agalaetiae, Streptococcus sp., 
Trypanosoma brucei, Trypanosoma cruzi, Trypanosoma sp., are also objects of the 
present invention. 

30 Diagnostic kits further comprising probes or amplification primers for the 

detection of an antibiotic resistance gene selected from the group listed in Table 5 are 
also objects of this invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of a toxin gene selected from the group listed in Table 6 are also objects of 

35 this invention. 

Diagnostic kits further comprising probes or amplification primers for the 
detection of any bacterial, fungal or parasitical species, comprising or not comprising 
those for the detection of the specific microbial species or genus or family or group 
listed above, and further comprising or not comprising probes and primers for the 
40 antibiotic resistance genes listed above, and further comprising or not comprising 

9 
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nrobes and primers for the toxin genes listed above are also objects of this invention^ 
P in a preferred embodiment, such a kit allows for the separate or the 
simultaneous detection and identification of the above-listed ™^J"£»£ 
genus or family or group, antibiotic resistance genes, toxin genes and for the 

5 detection of any microorganism (bacteria, fungi or parasite). 

In the above methods and kits, amplification reactions may include but are ^no 
restricted to: a) polymerase chain reaction (PCR), b) ligase chain reaction (LCR), c) 
nucleic acid sequence-based amplification (NASBA), d) self-sustained sequence 
^cation (3SR), e) strand displacement amplification (SDA) 3J«*^ 

0 signal amplification (bDNA), g) transcription-mediated amplification (™A) h) 
cycling probe technology (CPT), i) nested PCR, j) multiplex PCR k) solid phase 
ampUfication (SPA), 1) nuclease dependent signal amplification (NDSA), m) rolling 

circle amplification technolog> ^T) of target genes 

In the above meaioas ana v*v*e%,au.* v , _ % . A „ 

is mav delude real-time or post-amplification technologies. These detection 
ZLZ&s can include, but are not limited to, fluorescence resonance energy 
T^sfer T^ETD-based methods such as adjacent hybridization to FRET probes 
Sding ^probe-probe and probe-primer methods), Taqman, 

Scomionf Other detection methods include target genes nucleic acids detection via 
20 m^nolog^al methods, solid phase hybridization methods on filters, chips or any 
oTr stud support, whether-the hybridization is monitored by fluorescence, 
^llun^escence, potentiometry, mass spectrometry, plasmon resource 
I™™ or colorimetry. Sequencing, including sequencing by dideoxy 
£2 o "equtcing b/hybridization, is another possible method to detect and 
25 identify the nucleic acids of target genes. 

In a preferred embodiment, a PCR protocol is used for nucleic actd 

^'""particularly preferred embodiment, a PCR protocol is provided, 
comprising, an initial denaturation step of 1-3 minutes a. 95 ,°C, j« * » 

30 ampUfication cycle including a denaturation step of one second at 95 C and an 
aTealing step of 30 seconds a. 45-65 "C, without any time allowed 
the elongation step. This PCR protocol has been standardized to be suitable for PCR 
reactions with most selected primer pairs, which greatly facilitates the testmg because 
each clinical sample can be tested with universal, species-specific, genus-speaSc, 

35 antibiotic resistance gene and toxin gene PCR primers under uniform cydtng 
conditions. Furthermore, various combinations of primer pairs may be used in 

m *SZb je c t of the present invention that ,„/ sequences, KpD sequences 
and recA sequences could serve as drug targets and these sequences and means to 
obtain them revealed in the present invention can assist the screening, design and 



40 

modeling of these drugs 
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It is also an object of the present invention that tuf sequences, atpD sequences 
and recA sequences could serve for vaccine purposes and these sequences and means 
to obtain them revealed in the present invention can assist the screening, design and 
modeling of these vaccines. 

5 

We aim at developing a rapid test or kit (universal) to discard rapidly all the 
samples which are negative for bacterial, fungal or parasitical cells. This test could be 
used alone or combined with more specific identification to detect and identify the 
above bacterial and/or fungal and/or parasitical species and genera and to determine 

10 rapidly the bacterial resistance to antibiotics and/or presence of bacterial toxins. 
Although the sequences from the selected antibiotic resistance genes are available 
from public databases and have been used to develop DNA-based tests for their 
detection, our approach is unique because it represents a major improvement over 
current diagnostic methods based on bacterial cultures. Using an amplification 

15 method for the simultaneous bacterial detection and identification and antibiotic 
resistance genes detection, there is no need for culturing the clinical sample prior to 
testing. Moreover, a modified PCR protocol has been developed to detect all target 
DNA sequences in approximately one hour under uniform amplification conditions. 
This procedure will save lives by optimizing treatment, will diminish antibiotic 

20 resistance because less antibiotics will be prescribed, will reduce the use of broad 
spectrum antibiotics which are expensive, decrease overall health care costs by 
preventing or shortening hospitalizations, and decrease the time and costs associated 
with clinical laboratory testing. 

The diagnostic kits, primers and probes mentioned above can be used to 

25 identify bacteria, fungi, parasites, antibiotic resistance genes and toxin genes on any 
type of sample, whether said diagnostic kits, primers and probes are used for in vitro 
or in situ applications. The said samples may include but are not limited to: any 
clinical sample, any environment sample, any microbial culture, any microbial 
colony, any tissue, and any cell line. 

30 It is also an object of the present invention that said diagnostic kits, primers 

and probes can be used alone or in conjunction with any other assay suitable to 
identify microorganisms, including but not limited to: any immunoassay, any 
enzymatic assay, any biochemical assay, any lysotypic assay, any serological assay, 
any differential culture medium, any enrichment culture medium, any selective 

35 culture medium, any specific assay medium, any identification culture medium, any 
enumeration cuture medium, any cellular stain, any culture on specific cell lines, and 
any infectivity assay on animals. 

In the methods and kits described herein below, the oligonucleotide probes and 
amplification primers have been derived from larger sequences (i.e. DNA fragments 

40 of at least 100 base pairs). All DNA fragments have been obtained either from 
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proprietary fragments or from public databases. DNA fragments selected from public 
databases are newly used in a method of detection according to the present invention, 
since they have been selected for their diagnostic potential. 

In another embodiment, the amino acid sequences translated from the repertory 
5 of tuf> atpD and recA sequences are also an object of the present invention. 

It is clear to the individual skilled in the art that other oligonucleotide 
sequences appropriate for (i) the universal bacterial detection, (ii) the detection and 
identification of the above microbial species or genus or family or group, and (iii) the 
detection of antibiotic resistance genes, and (iv) the detection of toxin genes other 

10 than those listed in Annexes I to III and XXI may also be derived from the 
proprietary fragments or selected public databases sequences. For example, the 
oligonucleotide primers or probes may be shorter or longer than the ones we have 
chosen; they may also be selected anywhere else in the proprietary DNA fragments or 
in the sequences selected from public databases; they may be also variants of the 

15 same oligonucleotide. If the target DNA or a variant thereof hybridizes to a given 
oligonucleotide, or if the target DNA or a variant thereof can be amplified by a given 
oligonucleotide PCR primer pair, the converse is also true; a given target DNA may 
hybridize to a variant oligonucleotide probe or be amplified by a variant 
oligonucleotide PCR primer. Alternatively, the oligonucleotides may be designed 

20 from any DNA fragment sequences for use in amplification methods other than PCR. 
Consequently, the core of this invention is the identification of universal, species- 
specific, genus-specific, resistance gene-specific, toxin gene-specific genomic or non- 
genomic DNA fragments which are used as a source of specific and ubiquitous 
oligonucleotide probes and/or amplification primers. Although the selection and 

25 evaluation of oligonucleotides suitable for diagnostic purposes requires much effort, 
it is quite possible for the individual skilled in the art to derive, from the selected 
DNA fragments, oligonucleotides other than the ones listed in Annexes I to III and 
XXI which are suitable for diagnostic purposes. When a proprietary fragment or a 
public databases sequence is selected for its specificity and ubiquity, it increases the 

30 probability that subsets thereof will also be specific and ubiquitous. 

Since a high percentage of clinical specimens are negative for bacteria (Table 
3), DNA fragments having a high potential for the selection of universal 
oligonucleotide probes or primers were selected from proprietary and public 
databases sequences. The amplification primers were selected from genes highly 

35 conserved in bacteria, fungi and parasites, and are used to detect the presence of any 
bacterial or fungal or parasitical pathogen in clinical specimens in order to determine 
rapidly (less than one hour) whether it is positive or negative for bacteria, fungi or 
parasites. The selected genes, designated tuf, atpD and recA, encode respectively 2 
proteins (elongation factors Tu and G) involved in the translational process during 

40 protein synthesis, a protein (beta subunit) responsible for the catalytic activity of 
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proton pump ATPase and a protein responsible for the homologous recombination of 
genetic material. The alignments of tuf, atpD and recA sequences used to derive the 
universal primers include both proprietary and public databases sequences. The 
universal primer strategy allows the rapid screening of the numerous negative clinical 
5 specimens (around 80% of the specimens received, see Table 3) submitted for 
bacteriological testing. 

Table 4 provides a list of the bacterial, fungal and parasitical species for which 
tuf and/or atpD and/or recA sequences are revealed in the present invention. Tables 5 
and 6 provide a list of antibiotics resistance genes and toxin genes selected for 
10 diagnostic purposes. Table 7 provides the origin of tuf, atpD and recA sequences 
listed in the sequence listing. Tables 8-10 provide lists or species used to test 
specificity and ubiquity of some assays described in examples. 
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DETAILED DESCRIPTION OF THE INVENTION 

HIGHLY CONSERVED GENES AND THEIR USE TO GENERATE SPECIES- 
SPECIFIC, GENUS-SPECIFIC. FAMILY-SPECIFIC, GROUP-SPECIFIC AND 
5 UNIVERSAL NUCLEIC ACID PROBES AND AMPLIFICATION PRIMERS TO 
RAPIDLY DETECT AND IDENTIFY BACTERIAL. FUNGAL AND PARASITICAL 
PATHOGENS FROM CLINICAL SPECIMENS FOR DIAGNOSIS 

10 The present inventors compared the published Haemophilus influenzae and 

Mycoplasma genitalium genomes and searched for the most conserved genes, which 
would then serve, as paradigm to develop primers and probes. This sequence 
comparison is highly informative as these two bacteria are distantly related and most 
genes present in the minimal genome of M. genitalium are likely to be present in 

15 every bacterium. Therefore genes conserved between these two bacteria are likely to 
be conserved in all other bacteria. 

Following the genomic comparison, it was found that several protein-coding 
genes were conserved in evolution. Highly conserved proteins included the 
translation elongation factor Tu (EF-Tu) and the P subunit of F()Fl type ATP- 

20 synthase, and to a lesser extent, the RecA recombinase. 

Translation elongation factor Tu is a member of a family of GTP-binding 
proteins which- intervene in the interactions of tRNA molecules with the ribosome 
machinery during essential steps of protein synthesis. The role of elongation factor 
Tu is to facilitate the binding of aminoacylated tRNA molecules to the A site of the 

25 ribosome. The eukaryotic and archaebacterial homolog of EF-Tu is called elongation 
factor 1 alpha (EF-la). All protein synthesis factors originated from a common 
ancestor via gene duplications and fusions (Cousineau et al., 1997, J. Mol. Evol. 
45:661-670). In particular, elongation factor G (EF-G), although having a functional 
role in promoting the translocation of aminoacyl-tRNA molecules from the A site to 

30 the P site of the ribosome, shares sequence homologies with EF-Tu and is thought to 
have arisen from the duplication and fusion of an ancestor of the EF-Tu gene. 
In addition, EF-Tu is known to be the target for antibiotics belonging to the 
elfamycin's group as well as to other structural classes (Anborgh and Parmeggiani, 
1991, EMBO J. 10:779-784; Luiten et aL, 1992, European patent application serial 

35 No. EP 0 466 251 Al). EF-G for its part, is the target of the antibiotic fusidic acid. In 
addition to its crucial activities in translation, EF-Tu has chaperone-like functions in 
protein folding, protection against heat denaturation of proteins and interactions with 
unfolded proteins (Caldas et al, 1998, J. Biol. Chem 273:11478-11482). 
Interestingly, a form of the EF-Tu protein has been identified as a dominant 

40 component of the periplasm of Neisseria gonorrhoeae (Porcella et al., 1996, 
Microbiology 142:2481-2489), hence suggesting that at least in some bacterial 
species, EF-Tu might be an antigen with vaccine potential. 

FoFi type ATP-synthase belongs to a superfamily of proton-translocating 
ATPases divided in three major families: P, V and F (Nelson and Taiz, 
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1989, TIBS 14:113-116). P-ATPases (or E r E 2 type) operate via a phosphorylated 
intermediate and are not evolutionarily related to the other two families. V-ATPases 
(or VoVl type) are present on the vacuolar and other endomembranes of eukaryotes, 
on the plasma membrane of archaebacteria and also on the plasma membrane of some 
5 eubacteria especially species belonging to the order Spirochaetales as well as to the 
Chlamydiaceae and Deinococcaceae families. F-ATPases (or F()Fi type) are found 
on the plasma membrane of most eubacteria, on the inner membrane of mitochondria 
and on the thylakoid membrane of chloroplasts. They function mainly in ATP 
synthesis. They are large multimeric enzymes sharing numerous structural and 

10 functional features with the V-ATPases. F and V-type ATPases have diverged from a 
common ancestor in an event preceding the appearance of eukaryotes. The 0 subunit 
of the F-ATPases is the catalytic subunit and it possesses low but significant 
sequence homologies with the catalytic A subunit of V-ATPases. 

The translation elongation factors EF-Tu, EF-G and EF-lot, and the catalytic 

15 subunit of F or V-types ATP-synthase, are highly conserved proteins sometimes used 
for phylogenetic analysis and their genes are also known to be highly conserved 
(Iwabe et al. y 1989, Proc. Nad. Acad. Sci. USA 86:9355-9359, Gogarten et aU 1989, 
Proc. Natl. Acad. Sci. USA 86:6661-6665, Ludwig et al. 9 1993, Antonie van 
Leeuwenhoek 64:285-305). A recent BLAST (Altschul et al. 9 1997, J. Mol. Biol. 

20 215:403-410) search performed by the present inventors on the GenBank, EMBL, 
DDBJ and specific genome project databases indicated that throughout bacteria, the 
EF-Tu and the p subunit of FoFi type ATP-synthase genes may be more conserved 
than other genes that are well conserved between Hrinfluenzae and M. genitalium. 

The RecA recombinase is a multifunctional protein encoded by the recA gene. 

25 It plays a central role in homologous recombination, it is critical for the repair of 
DNA damage and it is involved in the regulation of the SOS system by promoting the 
proteolytic digestion the LexA repressor. It is highly conserved in bacteria and could 
serve as a useful genetic marker to reconstruct bacterial phylogeny (Miller and 
Kokjohn, 1990, Annu. Rev. Microbiol. 44:365-394). Althought recA possesses some 

30 highly conserved sequence segments that we used to design universal primers aimed 
at sequencing the recA fragments, it is clearly not as well conserved as tuf and atpD. 
Hence, recA may not be optimal for universal detection of bacteria with high 
sensitivity but it was chosen beacuse preliminary data indicated that to/ and atpD may 
sometimes be too closely related to find specific primer pairs that could discriminate 

35 between certain very closely related species and genera. While RecA, like tuf and 
atpD, possesses highly conserved regions suitable for the design of universal 
sequencing primers, the less conserved region between primers should be divergent 
enough to allow species -specific and genus-specific primers in those cases. 

Thus, as targets to design primers and probes for the genetic detection of 

40 microorganisms, the present inventors have focused on the genes encoding these 
three proteins: tuf, the gene for elongation factor Tu; atpD, the gene for (3 subunit of 
FoFi type ATP-synthase; and recA y the gene encoding the RecA recombinase. In 
several bacterial genomes tuf is often found in two highly similar duplicated copies 
named tufA and tufB (Filer and Furano, 1981, J. Bacteriol. 148:1006-1011, Sela et aL, 
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1989, J. Bacteriol. 171:581-584). In some particular cases, more divergent copies of 
the tuf genes can exist in some bacterial species such as some Actinomycetes (Luiten 
et al. European patent application publication No. EP 0 446 251 Al; Vijgenboom et 
al., 1994, Microbiology 140:983-998) and, as revealed as part of this invention, in 
several enterococcal species. In several bacterial species, tuf is organized in an 
operon with its homolog gene for the elongation factor G (EF-G) encoded by the 
fusA gene. This operon is often named the str operon. The tuf atpD and recA genes 
were chosen as they are well conserved in evolution and have highly conserved 
stretches as well as more variable segments. Moreover, these three genes have 
eukaryotic orthologs which are described in the present invention as targets to 
identify fungi and parasites. The eukaryotic homolog of elongation factor Tu is called 
elongation factor 1-alpha (EF-la) (gene name: tef tefl, efJ, ef-1 or EF-1). In fungi, 
the gene for EF-la occurs sometimes in two or more highly similar duplicated copies 
(often named tefl, tef2, tef3.„). In addition, eukaryotes have a copy of elongation 
factor Tu which is originating from their organelle genome ancestry (gene name: tufl, 
iufM or tujA). For the purpose of the current invention, the genes for these four 
functionally and evolutionarily linked elongation factors (bacterial EF-Tu and EF-G, 
eukaryotic EF-la, and organellar EF-Tu) will hereafter be designated as «tuf 
sequences». The eukaryotic (mitochondrial) F()Fi type ATP-synthase beta subunit 
gene is named atp2 in yeast. For the purpose of the current invention, the genes of 
catalytic subunit of either F or V-type ATP-synthase will hereafter be designated as 
«atpD sequences*. The eukaryotic homologs of RecA are distributed in two families, 
typified by the Rad51 and Dmcl proteins. For the purpose of the current invention, 
the genes corresponding to the latter proteins will hereafter be designated as «recA 
sequences». 

Analysis of multiple sequence alignments of tuf and atpD sequences permitted 
the design of oligonucleotide primers (and probes) capable of amplifying (or 
hybridizing to) segments of tuf and atpD genes from a wide variety of bacterial 
species (see Examples 1 to 4 and Table 7). Sequencing primer pairs for tuf sequences 
are listed in Annex I and hybridization probes are listed in Annex III. Sequencing 
primer pairs for atpD sequences are listed in Annex II. Analysis of the main 
subdivisions of tuf and atpD sequences (see Figures 1 and 2) permitted to design 
sequencing primers amplifying specifically each of these subdivisions. It should be 
noted that these sequencing primers could also be used as universal primers. 
However, since some of these sequencing primers include several variable sequence 
(degenerated) positions, their sensitivity could be lower than that of universal primers 
developed for diagnostic purposes. Further subdivisions could be done on the basis of 
the various phyla where these genes are encountered. 

Similarly, analysis of multiple sequence alignments of recA sequences present in 
the public databases permitted the design of oligonucleotide primers capable of 
amplifying segments of recA genes from a wide variety of bacterial species. 
Sequencing primer pairs for recA sequences are listed in Annex XXI. The main 
subdivisions of recA sequences comprise recA, rad51 and dmcl. Further subdivisions 
could be done on the basis of the various phyla where these genes are encountered. 
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The present inventor's strategy is to get as much sequence data information from 
the three conserved genes {tuf, atpD and recA). This ensemble of sequence data 
forming a repertory (with subrepertories corresponding to each target gene and their 
main sequence subdivisions) and then using the sequence information of the sequence 
repertory (or subrepertories) to design primer pairs that could permit either universal 
detection of bacteria or fungi or parasites, detection of a family or group of 
microorganism (e.g. Enterobacteriaceae), detection of a genus (e.g. Streptococcus) or 
finally a specific species (e.g. Staphylococcus aureus). It should be noted that for the 
purpose of the present invention a group of microorganisms is defined depending on 
the needs of the particular diagnostic test. It does not need to respect a particular 
taxonomical grouping or phylum. See example 12 where primers were designed to 
amplify a group a bacteria consisting of the 17 major bacterial species encountered as 
contaminants of platelet concentrates. Also remark that in that example, the primers' 
specificity is not perfect since the objective of that particular test is to be able to 
15 sensitively and rapidly detect at least the 17 important bacterial species, but the 
primers could also detect other species as well, as shown in Table 14. In these 
circumstances the primers shown in example 12 are considered universal for platelet- 
contaminating bacteria. To develop an assay specific for the latter, one or more 
primers or probes specific to each species could be designed. Another example of 
20 primers and/or probes for group detection is given by the Pseudomonad group 
primers. These primers were designed based upon alignment of tuf sequences from 
real Pseudomonas species as well as from former Pseudomonas species such as 
Stenotrophomonas maltophilia. The resulting primers are able to amplify all 
Pseudomonas species tested as well as several species belonging to different genera, 
25 hence as being specific for a group including Pseudomonas and other species, we 
defined that group as Pseudomonads as several members were former Pseudomonas. 

For certain applications, it may be possible to develop a universal, group, family 
or genus-specific reaction and to proceed to species identification using sequence 
information within the amplicon to design species-specific internal probes or primers 
30 or alternatively, to proceed direcdy by sequencing the amplicon. The various 
strategies will be discussed further below. 

The ensembles formed by public and proprietary tuf, atpD and recA sequences 
are used in a novel fashion so they constitute three databases containing useful 
information for the identification of microorganisms. 



35 



Oligonucleotide primers and probes design and synthesis 

The tuf, atpD and recA DNA fragments sequenced by us or selected from public 
databases (GenBank and EMBL) were used to design oligonucleotides primers and 
probes for diagnostic purposes. We also relied on the corresponding peptide sequence 
of tuf, atpD and recA sequences to facilitate the identification of regions suitable for 
primers and probes design. As part of the design rules, all oligonucleotides (probes 
for hybridization and primers for DNA amplification) were evaluated for their 
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suitability for hybridization or DNA amplification by polymerase chain reaction 
(PCR) by computer analysis using standard programs (i.e. the Genetics Computer 
Group (GCG) programs and the primer analysis software Oligo™ 5.0). The potential 
suitability of the PCR primer pairs was also evaluated prior to the synthesis by 
verifying the absence of unwanted features such as long stretches of one nucleotide 
and a high proportion of G or C residues at the 3' end (Persing et al. y 1993, 
Diagnostic Molecular Microbiology: Principles and Applications, American Society 
for Microbiology, Washington, D.C.). Oligonucleotide probes and amplification 
primers were synthesized using an automated DNA synthesizer (Perkin-Elmer Corp., 
Applied Biosystems Division). 

The oligonucleotide primers or probes may be derived from either strand of the 
duplex DNA. The primers or probes may consist of the bases A, G, C, or T or analogs 
and they may be degenerated at one or more chosen nucleotide position(s). The 
primers or probes may be of any suitable length and may be selected anywhere within 
the DNA sequences from proprietary fragments or from selected database sequences 
which are suitable for (i) the universal detection of bacteria or fungi or parasites, (ii) 
the species-specific detection and identification of any microorganism, including but 
not limited to: Candida albicans, Candida dubliniensis, Chlamydia pneumoniae, 
Chlamydia trachomatis, Cryptosporidium parvum, Enterococcus faecalis, 
Enterococcus faecium, Enterococcus gallinarum, Escherichia coli, Haemophilus 
influenzaer~ Neisseria gonorrhoeae, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus haemolyticus, Staphylococcus hominis, Staphylococcus 
saprophytics, Streptococcus agalactiae, Trypanosoma hruceU Trypanosoma cruzi, 
(iii) the genus-specific detection of Bordetella species, Candida species, Clostridium 
species, Corynebacterium species, Entamoeba species, Enterococcus species, 
Giardia species, Leishmania species, Staphylococcus species, Streptococcus species, 
Trypanosoma species, (iv) the family-specific detection of Enterobacteriaceae family 
members, Mycobacteriaceae family members, (v) the detection of Enterococcus 
casseliflavus-flavescens-gallinarum group, Trypanosomatidae family, Pseudomonads 
extended group, Platelet-contaminating bacteria group, or (vi) the detection of 
clinically important antibiotic resistance genes listed in Table 5, or (vii) the detection 
of clinically important toxin genes listed in Table 6. 

Variants for a given target bacterial gene are naturally occurring and are 
attributable to sequence variation within that gene during evolution (Watson et al., 
1987, Molecular Biology of the Gene, 4 th ed., The Benjamin/Cummings Publishing 
Company, Menlo Park, CA; Lewin, 1989, Genes IV, John Wiley & Sons, New York, 
NY). For example, different strains of the same bacterial species may have a single or 
more nucleotide variation(s) at the oligonucleotide hybridization site. The person 
skilled in the art is well aware of the existence of variant bacterial or fungal DNA 
sequences for a specific gene and that the frequency of sequence variations depends 
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on the selective pressure during evolution on a given gene product. The detection of a 
variant sequence for a region between two PCR primers may be demonstrated by 
sequencing the amplification product. In order to show the presence of sequence 
variants at the primer hybridization site, one has to amplify a larger DNA target with 
PCR primers outside that hybridization site. Sequencing of this larger fragment will 
allow the detection of sequence variation at this site. A similar strategy may be 
applied to show variants at the hybridization site of a probe. Insofar as the divergence 
of the target sequences or a part thereof does not affect the specificity and ubiquity of 
the amplification primers or probes, variant bacterial DNA is under the scope of this 
invention. Variants of the selected primers or probes may also be used to amplify or 
hybridize to a variant DNA. 
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Sequencing 
species 



of tuf sequences from a variety of hartPri al. fungal and parasitic! 



The nucleotide sequence of a portion of resequences was determined for a 
variety of bacterial, fungal and parasitical species. The amplification primers (SEQ 
~ 107 108 in Previous patent application WO98/20157), which amplify a 
tuf gene portion of approximately 890 bp, were used along with newly designed 
sequencing primer pairs (See Annex I for the sequencing primers for *u/ sequences) 
Most primer pairs can amplify different copies of tuf genes (tufA and tufB) This is 
not surprising since it is known that for several bacterial species these two genes are 
nearly identical. For example, the entire tufA and tufB genes from E coli differ at 
only 3 nucleotide positions (Neidhardt et ah, 1996, Escherichia coli and Salmonella- 
Cellular and Molecular Biology, 2 nd ed., American Society for Microbiology Press' 
Washington, D.C.). Similarly, some fungi are known to have two nearly identical 
copies of tuf sequences (EF-la). These amplification primers are degenerated at 
several nucleotide positions and contain inosines in order to allow the amplification 
of a wide range of tuf sequences. The strategy used to select these amplification 
primers is similar to that illustrated in Annex I for the selection of universal primers 
The ^/sequencing primers even sometimes amplified highly divergent copies of tuf 
genes (tufQ as illustrated in the case of some enterococcal species (SEQ ID NOs ■ 
73 75, 76 614 to 618, 621 and 987 to 989). To prove this we first had to clone PCR 
products before being able to sequence them Using the cloned sequence data we 
designed a new pair of sequencing primers specific to the divergent (tufC) cop V of 
enterococci (SEQ ID NOs, 658-659 and 661) and then sequenced directiy the Lc 
^ TlaJ^ a ^ liflC&tion P rfmers ( S EQ ID NOs, 543, 556, 557, 660, 664, 694, 
696 and 697) could be used to amplify the ru/sequences from any bacterial species 
The amplification pnmers (SEQ ID NOs.: 558, 559, 560, 653, 654, 655, 813 and 815) 
could be used to amplify the tuf (EF-la) genes from any fungal and parasitical 
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species. The amplification primers SEQ ID NOs. 1221-1228 could be used to amplify 
bacterial tuf sequences of the EF-G subdivision (fusA) (Figure 3). The amplification 
primer SEQ ID NOs 1224, and 1227-1229 could be used to amplify bacterial tuf 
sequences comprising the end of EF-G (jusA) and the beginning of EF-Tu (tuf), as 
5 shown in Figure 3. 

The tuf fragments to be sequenced were amplified using the following 
amplification protocol: One fil of cell suspension (or of purified genomic DNA 0.1- 
0.5 ng/jil) was transferred directly to 19 fil of a PCR reaction mixture. Each PCR 
reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 9.0), 0.1% Triton X-100, 2.5 
10 mM MgCl 2 , 1 fiM of each of the 2 primers, 200 fiM of each of the four dNTPs, 0.5 
unit of Tag DNA polymerase (Promega Corp., Madison, WI). PCR reactions were 
subjected to cycling using a PTC-200 thermal cycler (MJ Research Inc., Watertown, 
Mass.) as follows: 3 min at 96 °C followed by 30-45 cycles of 1 min at 95 °C for the 
denaturation step, 1 min at 50-55 °C for the annealing step and 1 min at 72 °C for the 
15 extension step. Subsequently, twenty microliters of the PCR-amplified mixture were 
resolved by electrophoresis in a 1.5% agarose gel. The amplificon was then 
visualized by staining with methylene blue (Flores et al., 1992, Biotechniques, 
13:203-205). The size of the amplification products was estimated by comparison 
with a 100-bp molecular weight ladder. The band corresponding to the specific 
20 amplification product was excised from the agarose gel and purified using the 
QIAquick™ gel extraction kit (QIAGEN Inc.^iChatsworth; CA). The gel-purified 
DNA fragment was then used directly in the sequencing protocol. Both strands of the 
tuf genes amplification product were sequenced by the dideoxynucleotide chain 
termination sequencing method by using an Applied Biosystems automated DNA 
25 sequencer (model 377) with their Big Dye™ Terminator Cycle Sequencing Ready 
Reaction Kit (Perkin-Elmer Corp., Applied Biosystems Division, Foster City, CA). 
The sequencing reactions were performed by using the same amplification primers 
and 10 ng/100 bp of the gel-purified amplicon per reaction. For the sequencing of 
long amplicons such as those of eukaryotic tuf (EF-la) sequences, we designed 
30 internal sequencing primers (SEQ ID NOs.: 654, 655 and 813) to be able to obtain 
sequence data on both strands for most of the fragment length. In order to ensure that 
the determined sequence did not contain errors attributable to the sequencing of PCR 
artefacts, we have sequenced two preparations of the gel-purified tuf amplification 
product originating from two independent PCR amplifications. For most target 
microbial species, the sequences determined for both amplicon preparations were 
identical. In case of discrepancies, a third independent PCR amplification was 
sequenced. Furthermore, the sequences of both strands were 100% complementary 
thereby confirming the high accuracy of the determined sequence. The ft*/ sequences 
determined using the above strategy are described in the Sequence Listing. Table 7 
gives the originating microbial species and the source for each tuf sequence in the 
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Sequence Listing. 

The alignment of the tuf sequences determined by us or selected from 
databases reveals clearly that the length of the sequenced portion of the tuf genes is 
variable. There may be insertions or deletions of several amino acids. In addition, in 
several fungi introns were observed. Intron sequences are part of tuf sequences and 
could be useful in the design of species-specific primers and probes. This explains 
why the size of the sequenced tuf amplification products was variable from one 
species to another. Consequently, the nucleotide positions indicated on top of each of 
Annexes IV to XXI, XXIII to XXXI, XXXVIII and XLII do not correspond for 
sequences having insertions or deletions. 

It should also be noted that the various tuf sequences determined by us 
occasionally contain degenerescences. These degenerated nucleotides correspond to 
sequence variations between tufA and tufB genes (or copies of the EF-G subdivision 
of tuf sequences, or copies of EF- 1 a subdivision of tuf sequences for fungi and 
parasites) because the amplification primers amplify both tuf genes. These nucleotide 
variations were not attributable to nucleotide misincorporations by the Taq DNA 
polymerase because the sequence of both strands was identical and also because the 
sequences determined with both preparations of the gel-purified tuf amplicons were 
identical. 

The selection of amplification primers from tuf sequences 

The tuf sequences determined by us or selected from public databases were used 
to select PCR primers for (i) the universal detection of bacteria, (ii) the genus-specific 
detection and identification of Enterococcus sp. and Staphylococcus sp. and (iii) the 
species-specific detection and identification of Candida albicans. The strategy used 
to select these PCR primers was based on the analysis of multiple sequence 
alignments of various tuf sequences. For more details about the selection of PCR 
primers from tuf sequences please refer to Examples and Annexes. 

Sequencing of atoD and recA sequences from a variety of bacterial, fungal and 
parasitical species 

The method use to obtain atpD and recA sequences is similar to that described 
above for tuf sequences. 

The selec tion of amplification primers from atvD or recA 



The comparison of the nucleotide sequence for the atpD or recA genes from 
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various bacterial fungal and parasitical species allowed the selection of PCR primers 
(refer to Examples 1, 2 and 6 and Annexes IV, V, X, XX, XXI). 

DNA amplification 

For DNA amplification by the widely used PCR (polymerase chain reaction) 
method, primer pairs were derived from proprietary DNA fragments or from database 
sequences. Prior to synthesis, the potential primer pairs were analyzed by using the 
Oligo™ 5.0 software to verify that they were good candidates for PCR amplification. 

During DNA amplification by PCR, two oligonucleotide primers binding 
respectively to each strand of the heat-denatured target DNA from the bacterial 
genome are used to amplify exponentially in vitro the target DNA by successive 
thermal cycles allowing denaturation of the DNA, annealing of the primers and 
synthesis of new targets at each cycle (Persing et aU 1993. Diagnostic Molecular 
Microbiology: Principles and Applications, American Society for Microbiology, 
Washington, D.C.). 

Briefly, the PCR protocols were as follows: Treated clinical specimens or 
standardized bacterial or fungal or parasitical suspensions (see below) or purified 
genomic DNA from bacteria, fungi or parasites were amplified in a 20 fil PCR 
reaction mixture. Each PCR reaction contained 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 2.5 mM MgCl 2 , 0.4 fiM of each primer, 200 fiM of each of the four dNTPs and 
0.5 unit of Taq DNA polymerase (Promega) combined with the TaqStart™ antibody 
(Clontech Laboratories Inc., Palo Alto, CA). The TaqStart™ antibody, which is a 
neutralizing monoclonal antibody to Taq DNA polymerase, was added to all PCR 
reactions to enhance the specificity and the sensitivity of the amplifications (Kellogg 
et a/., 1994, Biotechniques 16:1134-1137). The treatment of the clinical specimens 
varies with the type of specimen tested, since the composition and the sensitivity 
level required are different for each specimen type. It consists in a rapid protocol to 
lyse the bacterial cells and eliminate the PCR inhibitory effects. For amplification 
from bacterial or fungal cultures or from purified genomic DNA, the samples were 
added directly to the PCR amplification mixture without any pre-treatment step. An 
internal control was derived from sequences not found in the target microorganisms 
or in the human genome. The internal control was integrated into all amplification 
reactions to verify the efficiency of the PCR assays and to ensure that significant 
PCR inhibition was absent. Alternatively, an internal control derived from rRNA was 
also useful to monitor the efficiency of microbial lysis protocols. 

PCR reactions were then subjected to thermal cycling (3 min at 95 °C followed 
by 30 cycles of 1 second at 95 °C for the denaturation step and 30 seconds at 50-65 
°C for the annealing-extension step) using a PTC-200 thermal cycler (MJ Research 
Inc.). The number of cycles performed for the PCR assays varies according to the 
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sensitivity level required. For example, the sensitivity level required for microbial 
detection directly from clinical specimens is higher for blood specimens than for 
urine specimens because the concentration of microorganisms associated with a 
septicemia can be much lower than that associated with a urinary tract infection. 
5 Consequently, more sensitive PCR assays having more thermal cycles are required 
for direct detection from blood specimens. Similarly, PCR assays performed directly 
from bacterial or fungal or parasitical cultures may be less sensitive than PCR assays 
performed directly from clinical specimens because the number of target organisms is 
normally much lower in clinical specimens than in microbial cultures. 
10 The person skilled in the art of DNA amplification knows the existence of 

other rapid amplification procedures such as ligase chain reaction (LCR) 
transcription-mediated amplification (TMA), self-sustained sequence replication 
(3SR), nucleic acid sequence-based amplification (NASBA), strand displacement 
amplification (SDA), branched DNA (bDNA), cycling probe technology (CPT), solid 
15 phase amplification (SPA), rolling circle amplification technology (RCAT), and 
nuclease dependent signal amplification (NDSA) (Lee et al, 1997, Nucleic Acid 
Amplification Technologies: Application to Disease Diagnosis, Eaton Publishing 
Boston, MA; Persing et al, 1993, Diagnostic Molecular Microbiology: Principles and 
Applications, American Society for Microbiology, Washington, D.C.). The scope of 
20 this invention is not limited to the use of amplification by PCR, but rather includes 
the use of any rapid nucleic acid amplification method or any other procedure which 
may be used to increase rapidity and sensitivity of the nucleic acid-based tests The 
scope of the present invention -also covers the useof any nucleic acids amplification 
and detection technology including real-time or post-amplification detection 
25 technologies, any amplification technology combined with detection any 
hybridization nucleic acid chips or arrays technologies, any amplification chips or 
combination of amplification and hybridization chips technologies. Detection and 
identification by any sequencing method is also under the scope of the present 
invention. r 

Any oligonucleotide suitable for the amplification of nucleic acids by 
approaches other than PCR or for DNA hybridization and derived from the species- 
specific, genus-specific and universal DNA fragments as well as from selected 
antibiotic resistance or toxin gene sequences included in this document are also under 
the scope of this invention. 
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Detectio n of amplification products 

Classically, detection of amplification is performed by standard ethidium 
bromide-stained agarose gel electrophoresis. It is clear that other methods for the 
detection of specific amplification products, which may be faster and more practical 
for routine diagnosis, may be used. Such methods may be based on the detection of 
fluorescence after or during amplification. One simple method for monitoring 
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amplified DNA is to measure its rate of formation by measuring the^ increase in 
fluorescence of intercalating agents such as ethidium bromide or SYBR® Green I. If 
more specific detection is required, fluorescence-based technologies can monitor the 
appearance of a specific product during the reaction. The use of dual-labeled 
5 fluorogenic probes such as in the TaqMan™ system which utilizes the 5'-3' 
exonuclease activity of the Taq polymerase is a good example (Livak K.J. et al. 1995, 
PCR Methods Appl. 4:357-362). TaqMan™ can be performed during amplification 
and this "real-time" detection can be done in a single closed tube hence eliminating 
post-PCR sample handling and consequently preventing the risk of amplicon 
10 carryover (TaqMan™ system from Perkin Elmer or Amplisensor™ from Biotronics). 
Several other fluorescence-based detection methods can be performed in real-time. 
Fluorescence resonance energy transfer (FRET) is the principle behind the use of 
adjacent hybridization probes and molecular beacons. Adjacent hybridization probes 
are designed -to be internal to the amplification primers. The 3' end of one probe is 
15 labelled with a donor fluorophore while the 5' end of an adjacent probe is labelled 
with an acceptor fluorophore. When the two probes are specifically hybridized in 
closed proximity (spaced by 1 to 5 nucleotides) the donor fluorophore which has been 
excited by an external light source emits light that is absorbed by a second acceptor 
that emit more fluorescence and yields a FRET signal. Molecular beacons possess a 
20 stem-and-loop structure where the loop is the probe and at the bottom of the stem a 
-fluorescent moiety -is at one-end-wMle-a-quenGfiihglmoiety is at the other end. The 
beacons undergo a fluorogenic conformational change when they hybridize to their 
targets hence separating the fluorochrome from its quencher. The FRET principle is 
also used in an air thermal cycler with a built-in fluorometer (Wittwer, C.T. et al. 
25 1997. BioTechniques 22:130-138). The amplification and detection are extremely 
rapid as reactions are performed in capillaries : it takes 18 min to complete 45 cycles. 
Those techniques are suitable especially in the case where few pathogens are 
searched for. Boehringer-Roche Inc. sells the LightCycler™, an apparatus capable of 
rapid cycle PCR combined with fluorescent SYBR® Green I or FRET detection. We 
30 recently demonstrated in our laboratory, real-time detection of 10 CFU in less than 40 
minutes using adjacent hybridization probes on the LightCycler™. Methods based on 
the detection of fluorescence are particularly promising for utilization in routine 
diagnosis as they are very rapid, quantitative and can be automated. 

Microbial pathogens detection and identification may also be performed by solid 
35 support or liquid hybridization using species-specific internal DNA probes 
hybridizing to an amplification product. Such probes may be generated from any 
sequences from our repertory and designed to specifically hybridize to DNA 
amplification products which are objects of the present invention. Alternatively, the 
internal probes for species or genus or family or group detection and identification 
40 may be derived from the amplicons produced by a universal, family, group or genus 
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amplification assay. The oligonucleotide probes may be labeled with biotin or with 
digoxigenin or with any other reporter molecule (for more details see below the 
section on hybrid capture). Hybrization on a solid support is amendable to 
miniaturization. 

At present the oligonucleotide nucleic acid microarray technology is appealing. 
Currently, available low to medium density arrays (Heller et al., An integrated 
microelectronics hybridization system for genomic research and diagnostic 
applications. In: Harrison, D.J., and van den Berg, A., 1998, Micro total analysis 
systems 98, Kluwer Academic Publisher, Dordrecht.) could specifically capture 
fluorescent-labelled amplicons. Detection methods for hybridization are not limited 
to fluorescence; potentiometry, colorimetry and plasmon resonance are some 
examples of alternative detection methods. In addition to detection by hybridization, 
nucleic acid microarrays could be used to perform rapid sequencing by hybridization. 
Mass spectrometry could also be applicable for rapid identification of the amplicon or 
even for sequencing of the amplification products (Chiu and Cantor, 1999, Clinical 
Chemistry 45: 1578; Berkenkamp et al, 1998, Science 281:260). 

We also keep in mind the major challenge of molecular diagnostics tools, i.e. : 
integration of the major steps including sample preparation, genetic amplification, 
detection, data analysis and presentation (Anderson et al, Advances in integrated 
genetic analysis. In: Harrison, D.J., and van den Berg, A., 1998, Micro total analysis 
systems 98, Kluwer Academic Publisher, Dordrecht.). 

To ensure PCR efficiency, glycerol, dimethyl sulfoxide (DMSO) or other related 
solvents can be used to increase the sensitivity of the PCR and to overcome problems 
associated with the amplification of a target DNA having a high GC content or 
forming strong secondary structures (Dieffenbach and Dveksler, 1995, PCR Primer: 
A Laboratory Manual, Cold Spring Harbor Laboratory Press, Plainview, New York). 
The concentration ranges for glycerol and DMSO are 5-15% (v/v) and 3-10% (v/v), 
respectively. For the PCR reaction mixture, the concentration ranges for the 
amplification primers and MgCl 2 are 0.1-1.5 nM and 1.0-10.0 mM, respectively. 
Modifications of the standard PCR protocol using external and nested primers (i.e. 
nested PCR) or using more than one primer pair (i.e. multiplex PCR) may also be 
used (Persing et al, 1993, Diagnostic Molecular Microbiology: Principles and 
Applications, American Society for Microbiology, Washington, D.C.). For more 
details about the PCR protocols and amplicon detection methods, see Examples. 

Hybrid capture and chemiluminescence detection of amplifi cation products 

Hybridization and detection of amplicons by chemiluminescence were adapted 
from Nikiforov et al. (1994, PCR Methods and Applications 3:285-291 and 1995, 
Anal. Biochem. 227:201-209 ) and from the DIG™ system protocol of Boehringer 
Mannheim. Briefly, 50 [il of a 25 picomoles solution of capture probe diluted in EDC 
{l-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride} are immobilized in 
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each well of 96-wells plates (Microlite™ 2, Dynex) by incubation overnight at room 
temperature. The next day, the plates are incubated with a solution of 1% BSA 
diluted into TNTw (10 mM Tris-HCl, pH 7.5; 150 mM NaCl; 0.05% Tween™ 20) 
for 1 hour at 37 °C. The plates are then washed on a Wellwash Ascent™ 

5 (Labsystems) with TNTw followed by Washing Buffer (100 mM maleic acid pH7.5; 
150 mM NaCl; 0.3% Tween™ 20). 

The amplicons were labelled with DIG-ll-dUTP during PCR using the PCR 
DIG Labelling Mix from Boehringer Mannheim according to the manufacturer's 
instructions. Hybridization of the amplicons to the capture probes is performed in 

10 triplicate at stringent temperature (generally, probes are designed to allow hybrization 
at 55 °C, the stringent temperature) for 30 minutes in 1.5 M NaCl; 10 mM EDTA. It 
is followed by two washes in 2 X SSC; 0.1% SDS, then by four washes in 0.1 X SSC; 
0.1% SDS at the stringent temperature (55 °C). Detection with 1,2 dioxetane 
chemiluminescent alkaline phosphatase substrates like CSPD® (Tropix inc.) is 

15 performed according to the manufacturer's instructions but with shorter incubations 
times and a different antibody concentration. The plates are agitated at each step, the 
blocking incubation is performed for only 5 minutes, the anti-DIG-APl is used at a 
1:1000 dilution, the incubation with antibody lasts 15 minutes, the plates are washed 
twice for only 5 minutes. Finally, after a 2 minutes incubation into the detection 

20 buffer, the plates are incubated 5 min utes w ith CSPD® at room temperature followed 
by a 10 minutes incubation at 37 °C without agitation. Luminous signal detection is 
performed on a Dynex Microliter Plate Luminometer using RLU (Relative Light 
Units). 

25 Specificity and ubiquity tests for oligonucleotide primers and probes 

The specificity of oligonucleotide primers and probes was tested by 
amplification of DNA or by hybridization with bacterial or fungal or parasitical 
species selected from a panel comprising closely related species and species sharing 

30 the same anatomo-pathological site (see Annexes and Examples). All of the bacterial, 
fungal and parasitical species tested were likely to be pathogens associated with 
infections or potential contaminants which can be isolated from clinical specimens. 
Each target DNA could be released from bacterial cells using standard chemical 
and/or physical treatments to lyse the cells (Sambrook et al, 1989, Molecular 

35 Cloning: A Laboratory Manual, 2 nd ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY) or alternatively, genomic DNA purified with the GNOME™ 
DNA kit (BiolOl, Vista, CA) was used. Subsequently, the DNA was subjected to 
amplification with the primer pairs. 

Oligonucleotides primers found to amplify specifically the target species, genus, 

40 family or group were subsequently tested for their ubiquity by amplification (i.e. 
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ubiquitous primers amplified most or all isolates of the target species or genus or 
family or group). The specificity and ubiquity of the PCR assays using the selected 
amplification primer pairs were tested either directly from cultures of microbial 
species or from purified microbial genomic DNA. 

Probes were tested in hybrid capture assays as described above. An 
oligonucleotide probe was considered specific only when it hybridized solely to DNA 
from the species or genus or family or group from which it was selected. 
Oligonucleotide probes found to be specific were subsequently tested for their 
ubiquity (i.e. ubiquitous probes recognized most or all isolates of the target species or 
genus or family or group) by hybridization to microbial DNAs from different clinical 
isolates of the species or genus or family or group of interest including ATCC 
reference strains. Similarly, oligonucleotide primers and probes could be derived 
from antibiotic resistance or toxin genes which are objects of the present invention. 



15 Reference strains 

The reference strains used to built proprietary tuf, atpD and recA sequence data 
subrepertories, as well as to test the amplification and hybridization assays were 
obtained from (i) the American Type Culture Collection (ATCC), (ii) the Laboratoire 
20 de sante publique du Quebec (LSPQ), (Hi) the Centers for Disease Control and 
- Prevention (GDC), (iv)-the National-Culture Type Collection (NCTC) and (v) several 
other reference laboratories throughout the world. The identity of our reference 
strains was confirmed by phenotypic testing and reconfirmed by analysis of tuf, atpD 
and recA sequences (see example 13). 

25 

Antibiotic resistance genes 

Antimicrobial resistance complicates treatment and often leads to therapeutic 
failures. Furthermore, overuse of antibiotics inevitably leads to the emergence of 

30 bacterial resistance. Our goal is to provide clinicians, in approximately one hour, the 
needed information to prescribe optimal treatments. Besides the rapid identification 
of negative clinical specimens with DNA-based tests for universal bacterial detection 
and the identification of the presence of a specific pathogen in the positive specimens 
with species- and/or genus- and/or family- and/or group-specific DNA-based tests, 

35 clinicians also need timely information about the ability of the bacterial pathogen to 
resist antibiotic treatments. We feel that the most efficient strategy to evaluate rapidly 
bacterial resistance to antimicrobials is to detect directly from the clinical specimens 
the most common and clinically important antibiotic resistance genes (i.e. DNA- 
based tests for the detection of antibiotic resistance genes). Since the sequence from 

40 the most important and common bacterial antibiotic resistance genes are available 
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from public databases, our strategy is to use the sequence from a portion or from the 
entire resistance gene to design specific oligonucleotide primers or probes which will 
be used as a basis for the development of rapid DNA-based tests. The list of each of 
the bacterial antibiotic resistance genes selected on the basis of their clinical 
relevance (i.e. high incidence and importance) is given in Table 5. Our approach is 
unique because the antibiotic resistance genes detection and the bacterial detection 
and identification can be performed simultaneously in multiplex assays under 
uniform PCR amplification conditions. These amplifications can also be done 
separately. 

Toxin genes 

Toxin identification is often very important to prescribe optimal treatments. 
Besides the rapid identification of negative elinisal specimens with DNA-based tests 

15 for universal bacterial detection and the identification of the presence of a specific 
pathogen in the positive specimens with species- and/or genus- and/or family- and/or 
group-specific DNA-based tests, clinicians sometimes need timely information about 
the ability of certain bacterial pathogens to produce toxins. Since the sequence from 
the most important and common bacterial toxin genes are available from public 

20 databases, our strategy is to use the sequence from a portion or from the entire toxin 
-gene to design specific oligonucleotide primers -or probes which will be used as a 
basis for the development of rapid DNA-based tests. The list of each of the bacterial 
toxin genes selected on the basis of their clinical relevance (i.e. high incidence and 
importance) is given in Table 6. Our approach is unique because the toxin genes 

25 detection and the bacterial detection and identification can be performed 
simultaneously in multiplex assays under uniform PCR amplification conditions. 



Universal bacterial detection 

30 In the routine microbiology laboratory, a high percentage of clinical specimens 

sent for bacterial identification are negative by culture. Testing clinical samples with 
universal amplification primers or universal probes to detect the presence of bacteria 
prior to specific identification and screening out the numerous negative specimens is 
thus useful as it reduces costs and may rapidly orient the clinical management of the 

35 patients. Several amplification primers and probes were therefore synthesized from 
highly conserved portions of bacterial sequences from the tuf and atpD sequences. 
The universal primers selection was based on a multiple sequence alignment 
constructed with sequences from our repertory. 

All computer analysis of amino acid and nucleotide sequences were performed 

40 by using the GCG programs. Subsequently, optimal PCR primers for the universal 
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amplification of bacteria were selected with the help of the Oligo™ program. The 
selected primers are degenerated at several nucleotide positions and contain several 
inosines in order to allow the amplification of all clinically relevant bacterial species 
(Annex I). Inosine is a nucleotide analog able to specifically bind to any of the four 
nucleotides A, C, G or T. Degenerated oligonucleotides consist of an oligonucleotide 
mix having two or more of the four nucleotides A, C, G or T at the site of 
mismatches. The inclusion of inosine and/or of degenerescences in the amplification 
primers allow mismatch tolerance thereby permitting the amplification of a wider 
array of target nucleotide sequences (Dieffenbach and Dveksler, 1995 PCR Primer: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Plainview, NY). 

The amplification conditions with the universal primers are very similar to those 
used for the species- and genus-specific amplification assays except that the 
annealing temperature is slightly lower. The original universal PCR assay described 
in our co-pending WO98/20157 (SEQ ID NOs. 23-24 of the latter application) was 
specific and nearly ubiquitous for the detection of bacteria. The specificity for 
bacteria was verified by amplifying genomic DNA isolated from the 12 fungal 
species as well as genomic DNA from Leishmania donovani, Saccharomyces 
cerevisiae and human lymphocytes. None of the above eukaryotic DNA preparations 
could be amplified by the universal assay, thereby suggesting that this test is specific 
for bacteria. The ubiquity of the universal assay was verified by amplifying genomic 
DNAs from 116 reference strains which represent 95 of the most clinically relevant 
bacterial species. These species have been selected from the bacterial species listed in 
Table 4. We found that at least 104 of these species could be amplified. However, the 
assay could be improved since bacterial species which could not be amplified with 
the original tuf sequences-based assay included species belonging to the following 
genera: Corynebacterium (11 species) and Stenotrophomonas (1 species). Sequencing 
of the tuf genes from these bacterial species and others has been performed in the 
scope of the present invention in order to improve the universal assay. This 
sequencing data has been used to select new universal primers which may be more 
ubiquitous. Also, we improved our primer and probes design strategy by taking into 
consideration the phylogeny observed in analysing our repertory of tuf, atpD and 
recA sequences. Data from each of the 3 main subrepertories {tuf, atpD and recA) 
was subjected to a basic phylogenic analysis using the Pileup command from version 
10 of the GCG package (Genetics Computer Group, inc.). This analysis indicated the 
main branches or phyla reflecting the relationships between sequences. Instead of 
trying to design primers or probes able to hybridize to all phyla, we designed primers 
or probes able to hybridize to the main phyla while trying to use the largest phylum 
possible. This strategy should allow less degenerated primers hence improving 
sensitivity and by combining primers in a mutiplex assay, improve ubiquity. 
Universal primers SEQ ID NOs. 643-645 based on tuf sequences have been designed 
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to amplify most pathogenic bacteria except Actinomyceteae, Clostridiaceae and the 
Cytophaga, Flexibacter and Bacteroides phylum (pathogenic bacteria of this phylum 
include mostly Bacteroides, Porphyromonas and Prevotella s P* ci * s l^ m ™"™ 
these gaps have been designed for Actinomyceteae (SEQ ID NOs. 646-648), 
Clostridiaceae (SEQ ID NOs. 796-797, 808-811), and the Cytophaga, Flexibacter 
and Bacteroides phylum (SEQ ID NOs. 649-651). These primers sets could be used 
alone or in conjuction to render the universal assay more ubiquitous. These primers 
are in the process of being tested. 

Universal primers derived from atpD sequences include SEQ ID NOs. 562-565 
Combination of these primers does not amplify human DNA but should amplify 
almost all pathogenic bacterial species except proteobacteria belonging to the epsilon 
subdivision {Campylobacter and Helicobacter), the bacteria from the Cytophaga, 
Flexibacter and Bacteroides group and some actinomycetes and corynebacteria. By 
analysing atpD sequences from the latter species, primers and probes 
fill these gaps could be designed and used in conjuction with primers SEQ ID NOs. 
562-565 These primers are in the process of being tested. 

~ In addition, universality of the assay could be expanded by mixing atpD 
sequences-derived primers with tuf sequences-derived primers. Ultimately, even recA 
sequences-derived primers could be added to fill some gaps in the universal assay. 

It is important to note that the 95 bacterial species selected to test the ubiquity of 
the universal assay include all of the most clinically relevant bacterial species 
associated with a variety of human infections acquired in the community or in 
hospitals (nosocomial infections). The most clinically important bacterial and fungal 
pathogens are listed in Tables 1 and 2. 

a™;™ a Hd sentiences derived from tuf. atnD and recA sequences 

The amino acid sequences translated from the repertory of atpD, tuf and recA 
sequences are also an object of the present invention. The amino acid sequence data 

30 will be particularly useful for homology modeling of three-dimensional (3D) 
structure of the elongation factor Tu, elongation factor G, elongation factor la, 
ATPase subunit beta and RecA recombinase. For all these proteins, at least one 
structure model has been published using X-ray diffraction data from crystals. Based 
on those structural informations it is possible to use computer sofware to build 3D 

35 model structures for any other protein having peptide sequence homologies with the 
known structure (Greer, 1991, Methods in Enzymology, 202:239-252, Taylor, 1994, 
Sali 1995, Curr. Opin. Biotechnol. 6:437-451, Sanchez and Sali, 1997, Curr. Opin. 
Struct Biol 7:206-214, Fischer and Eisenberg, 1999, Curr. Opin. Struct. Biol. 9:208- 
211 Guex et al, 1999, Trends Biochem. Sci. 24: 364-367). Model structures of target 

40 proteins are used for the design or to predict the behavior of ligands and inhibitors 
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such as antibiotics. Since EF-Tu and EF-G are already known as antibiotic targets 
(see above) and since the beta subunit of ATPase and RecA recombinase are essential 
to the survival of the microbial cells in natural conditions of infection, all four 
proteins could be considered antibiotic targets. Sequence data, especially the new 
data generated by us could be very useful to assist the creation of new antibiotic 
molecules with desired spectrum of activity. In addition, model structures could be 
used to improve protein function for commercial purposes such as improving 
antibiotic production by microbial strains or increasing biomass. 



31 



CA 02307010 2000-0S-19 

BRIEF SUMMARY OF THE INVENTION 



Three highly conserved genes, encoding translation elongation factor Tu, the catalytic 
5 subunit of proton-translocating ATPase and the RecA recombinase, are used to 
generate species-specific, genus-specific, group-specific and universal nucleic acid 
probes and amplification primers to rapidly detect and identify bacterial, fungal and 
parasitical pathogens from clinical specimens for diagnosis. The concomittant 
detection of associated antibiotic resistance and toxin genes are also under the scope 
10 of the present invention. 



DESCRIPTION OF THE DRAWINGS 



15 



Figures 1 and 2 illustrate the principal subdivisions of the tuf and atpD sequences 
repertories, respectively. For the design of primers and probes, depending on the 
needs one may want to use the complete data set illustrated on the top of the pyramid 
or use only a subset illustrated by the different branching points. Smaller 
20 subdivisions, representing groups, families, genus and species, could even be made to 
extend to the bottom of the pyramid. Because the tuf and atpD sequences are highly 
conserved and evolved with each species, the design of primers and probes does not 
need to include all .the sequences within the database or its subdivisions. As 
illustrated, in Annexes IV to XX, depending on the use, sequences from a limited 
25 number of species can be carefully selected to represent: i) only the main 
phylogenetic branches from which the intended probes and primers need to be 
differentiating, and ii) only the species for which they need to be matching. However, 
for ubiquity purposes, and especially for primers and probes identifying large groups 
of species (genus, family, group or universal, or sequencing primers), the more data is 
30 included into the sequence analysis, the better the probes and primers will be suitable 
for each particular intended use. Similarly, for specificity purposes, a larger data set 
(or repertory) ensures optimal primers and probes design by reducing the chance of 
employing nonspecific oligonucleotides. 

Figure 3 illustrates the approach used to design specific amplification primers from 
35 fusA as well as from the region between the end of fusA and the beginning of tuf in 
the streptomycin (str) operon. 



EXAMPLES AND ANNEXES 

40 

The following examples and annexes are intended to be illustrative of the 
various methods and compounds of the invention, rather than limiting the scope 
thereof. 

32 



CA 02307010 2000-05-19 



The various annexes show the strategies used for the selection of DNA 
amplification primers, nucleic acid hybridization probes or molecular beacon internal 
probes from tuf atpD, recA, speA, stx, van, pbp, mecA, hexA, pep, ddl, mtl or 
5 unknown gene sequences: 

(i) Annex I shows the amplification primers used for tuf sequences. 

(ii) Annex II shows the amplification primers used for atpD sequences. 

(iii) Annex in shows the internal probes for nucleic acid hybridization and 
specific detection of tuf sequences. 

10 (iv) Annex IV illustrates the strategy used for the selection of the amplification 
primers specific for atpD sequences of the F-type. 

(v) Annex V illustrates the strategy used for the selection of the amplification 
primers specific for atpD sequences of the V-type. 

(vi) Annex VI illustrates the strategy used for the selection of the amplification 
15 primers specific for the tuf sequences of organelle lineage (M, the letter M is 

used to indicate that in most cases, the organelle is the mitochondria). 

(vii) Annex VII illustrates the strategy used for the selection of the amplification 
primers specific for the tuf sequences of eukaryotes (EF-1). 

(viii) Annex VIII illustrates the strategy for the selection of Streptococcus 
20 a galactiae-specific amplification primers from tuf sequences . 

Annex IX illustrates the strategy for the selection of Streptococcus 
agalactiae-specific hybridization probes from tuf sequences. 
Annex X illustrates the strategy for the selection of Streptococcus 
agalactiae-specific amplification primers from atpD sequences. 
25 (xi) Annex XI illustrates the strategy for the selection from tuf sequences of 
Candida albicans/dubliniensis-specific amplification primers, Candida 
albicans-specific hybridization probe and Candida dubliniensis-sp&cific 
hybridization probe. 

(xii) Annex XII illustrates the strategy for the selection of Staphylococcus- 
30 specific amplification primers from tuf sequences. 

(xiii) Annex XIII illustrates the strategy for the selection of the Staphylococcus 
genus-specific hybridization probe from tuf sequences. 

(xiv) Annex XIV illustrates the strategy for the selection of Staphylococcus 
saprophyticus-speeffic and Staphylococcus haemolyticus-specific 

35 hybridization probes from tuf sequences. 

(xv) Annex XV illustrates the strategy for the selection of Staphylococcus 
aureus-specific and Staphylococcus epidermidis-specific hybridization 
probes from tuf sequences. 

(xvi) Annex XVI illustrates the strategy for the selection of the Staphylococcus 
40 hominis-specific hybridization probe from tuf sequences. 
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(xvii) Annex XVII illustrates the strategy for the selection of the Enterococcus 
genus-specific amplification primers from tuf sequences. 

(xviii) Annex XVIII illustrates the strategy for the selection of the Enterococcus 
faecalis-spccmc hybridization probe, of the Enterococcus /aecium-specific 

5 hybridization probe and of the Enterococcus casseliflavus-flavescens- 

gallinarum group-specific hybridization probe from m/sequences. 

(xix) Annex XIX illustrates the strategy for the selection of primers from tuf 
sequences for me identification of platelets contaminants. 

(xx) Annex XX illustrates the strategy for the selection of the universal 
in amplification primers from atpD sequences. 

(xxi) Annex XXI illustrates the amplification primers used for nucleic acid 
amplification from recA sequences. 

(xxii) Annex XXH shows the specific and ubiquitous primers for nucleic acid 
amplification from speA sequences. 

15 (xxiii) Annex XXIII illustrates the first strategy for the selection of Streptococcus 
pyogenes-spccific amplification primers from speA sequences, 
fxxiv) Annex XXIV illustrates the second strategy for the selection of 
Streptococcus pyogenes-s^cifxo amplification primers from speA sequences. 

(xxv) Annex XXV illustrates the strategy for the selection of Streptococcus 
pwgenes-specific amplification primers from tuf sequences. 

(xxvi) Annex XXVI illustrates the strategy for the selection of Shiga- toxin- 
producing, Escherichia coZi-specific amplification primers and hybridization 
probe from stxj sequences. 

(xxvii) Annex XXVII illustrates the strategy for the selection of Shiga toxin- 
producing, Escherichia coZZ-specific amplification primers and hybridization 
probe from stx 2 sequences. 

(xxviii) Annex XXVIII illustrates the strategy for the selection of vanA-specific 
amplification primers from van sequences. 

(xxix) Annex XXIX illustrates the strategy for the selection of vanS-specific 
30 amplification primers from van sequences. 

(xxx) Annex XXX illustrates the strategy for the selection of vanC-specific 
amplification primers from vanC sequences. 

(xxxi) Annex XXXI illustrates the strategy for the selection of Streptococcus 
pneumoniae-specific amplification primers and hybridization probes from 

35 pbpla sequences. 

(xxxii) Annex XXXII shows the specific and ubiquitous primers for nucleic acid 
amplification from stx sequences. 

(xxxiii) Annex XXXIII shows the molecular beacon internal probes for hybridization 
and specific detection of toxin sequences. 

(xxxiv) Annex XXXIV shows the specific and ubiquitous primers for nucleic acid 
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amplification from van sequences. 

(xxxv) Annex XXXV shows the internal probes for nucleic acid hybridization and 
specific detection of van sequences. 

(xxxvi) Annex XXXVI shows the specific and ubiquitous primers for nucleic acid 
5 amplification from pbp sequences. 

(xxxvii) Annex XXXVII shows the internal probes for nucleic acid hybridization and 
specific detection of pbp sequences. 

(xxxviii) Annex XXXVIII illustrates the strategy for the selection of vanAB-specific 
amplification primers and specific hybridization probes vanA and vanB from 

10 van sequences. 

(xxxix) Annex XXXIX shows the internal probe for nucleic acid hybridization and 

specific detection of mecA. 
(xl) Annex XL shows the specific and ubiquitous primers for nucleic acid 

amplification from hexA sequences. 
15 (xli) Annex XLI shows the internal probe for nucleic acid hybridization and 

specific detection of hexA. 
(xlii) Annex XLII illustrates the strategy for the selection of Streptococcus 

pneumoniae species-specific amplification primers and hybridization probe 

from hexA sequences. 

20 (xliii) Annex XLHI shows the specific and ubiquitous primers for nucleic acid 
amplification from pep sequences, 
(xliv) Annex XLIV shows the specific and ubiquitous primers for nucleic acid 

amplification from unknown 5". saprophyticus gene sequences, 
(xlv) Annex XLV shows the molecular beacon internal probes for hybridization 
25 and specific detection of antibiotic resistance gene sequences. 

(xlvi) Annex XLVI shows the molecular beacon internal probe for hybridization 

and specific detection of an unknown S. aureus gene sequence, 
(xlvii) Annex XLVII shows the molecular beacon internal probe for hybridization 
and specific detection of tuf sequences. 
30 (xlviii) Annex XLVIII shows the molecular beacon internal probes for hybridization 
and specific detection of ddl and mil gene sequences, 
(xlix) Annex XLIX shows the internal probe for nucleic acid hybridization and 
specific detection of the unknown S. aureus gene. 

35 As shown in these annexes, the selected amplification primers may contain 

inosines and/or degenerescences. Inosine is a nucleotide analog able to specifically 
bind to any of the four nucleotides A, C, G or T. Alternatively, degenerated 
oligonucleotides which consist of an oligonucleotide mix having two or more of the 
four nucleotides A, C, G or T at the site of mismatches were used. The inclusion of 

40 inosine and/or of degenerescences in the amplification primers allows mismatch 
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tolerance thereby permitting the amplification of a wider array of target nucleotide 
sequences (Dieffenbach and Dveksler, 1995 PCR Primer: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, Plainview, New York). 



EXAMPLES 



EXAMPLE 1: 

Sequencing of bacterial atvD fF-tvoe and V-type) gene fragments. As shown in 
Annex IV, the comparison of publicly available atpD (F-type) sequences from a 
variety of bacterial species revealed conserved regions allowing the design of PCR 
primers able to amplify atpD sequences (F-type) from a wide range of bacterial 
species. Using primers pairs SEQ TD NOs. 566 and 567, 566 and 814, 568 and 567, 
570 and 567, 572 and 567, 569 and 567, 571 and 567, and 700 and 567, it was 
possible to amplify and sequence atpD sequences SEQ ID NOs. 242-270, 272-398, 
673-674, 737-767, 866-867 and 942-955. 

Similarly, Annex V shows how were designed the PCR primers able to amplify atpD 

sequences of the V-type from a wide range of bacterial species. Using primers SEQ 
jD NOs. 681-683, it was possible_to_amplify _and .sequence atpD sequences SEQ ID 
" Nos. 827-832"; 929-931, 95Tand 966. As the gene was difficult to amplify for several 

species, additional amplification primers were designed inside the original amplicon 

(SEQ ID NOs.1203-1207). 

EXAMPLE 2: 

Seq uencing of eukarvotic atpD (F-tvpe an d V-tvpe^) gene fragments. The comparison 
of publicly available atpD (F-type) sequences from a variety of fungal and parasitical 
species revealed conserved regions allowing the design of PCR primers able to 
amplify atpD sequences from a wide range of fungal and parasitical species. Using 
primers pairs SEQ ID NOs. 568 and 573, 574 and 573, 574 and 708, and 566 and 
567 it was possible to amplify and sequence atpD sequences SEQ ID NOs. 458-497, 
530-538, 663, 667, 676, 678-680, 768-778, 856-862, 889-896 and 941. 

In the same manner, the primers described in Annex V (SEQ ID NOs. 681-683)could 
amplify the atpD (V-type) gene from fungal and parasitical species. We were thus 
able to sequence SEQ ID Nos. 834-839, 956-957, and 959-965. 

EXAMPLE 3: 

Seq uencing of eukarvotic tuf (EF-1) pene fragments. As shown in Annex VII, the 
comparison of publicly available fw/(EF-l) sequences from a variety of fungal and 
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parasitical species revealed conserved regions allowing the design of PCR primers 
able to amplify tuf sequences from a wide range of fungal and parasitical species. 
Using primers pairs SEQ ID NOs. 558 and 559, 813 and 559, 558 and 815, 560 and 
559 653 and 559, 558 and 655, and 654 and 559, it was possible to amplify and 
sequence tuf sequences SEQ ID NOs. 399-457, 509-529, 622-624, 677, 779-790, 
840-842, 865, and 897-903. 



EXAMPLE 4: 



10 Sequencing of enkarvotic tuf (organelle origin, M) gene fragments. As shown in 
Annex VI, the comparison of publicly available tuf (organelle origin, M) sequences 
from a variety of fungal and parasitical organelles revealed conserved regions 
allowing the design of PCR primers able to amplify tuf sequences of several 
organelles belonging to a wide range fungal and parasitical species. Using primers 

15 pairs SEQ ID NOs. 664 and 652, 664 and 561, 911 and 914, 912 and 914, 913 and 
915 916 and 561, and 664 and 917, it was possible to amplify and sequence tuf 
sequences SEQ ID NOs. 498-508, 791-792, 843-855, and 904-910. 



EXAMPLE 5: 

20 

S pecific identification of Streptococcus analactia e using tuf sequences. As shown in 
Annex VIII, the comparison of tuf sequences from a variety of bacterial species 
allowed the selection of PCR primers specific for S. agalactiae. The strategy used to 
design the PCR primers was based on the analysis of a multiple sequence alignment 

25 of various tuf sequences. The multiple sequence alignment includes the tuf sequences 
of four bacterial strains from the target species as well as tuf sequences from other 
species and bacterial genera, especially representatives of closely related species. A 
careful analysis of this alignment allowed the selection of oligonucleotide sequences 
which are conserved within the target species but which discriminate sequences from 

30 other species and genera, especially from the closely related species, thereby 
permitting the species-specific and ubiquitous detection and identification of the 
target bacterial species. 

The chosen primer pair, oligos SEQ ID NO. 549 and SEQ ID NO. 550, gives an 
35 amplification product of 252 bp. Standard PCR was carried out using 0.4 fiM of each 
primer, 2.5 mM MgCl 2 , BSA 0.05 mM, IX Taq Buffer (Promega), dNTP 0.2 mM 
(Pharmacia), 1 fil Taq DNA polymerase (Promega) 0.025 U//il combined with 
TaqStart 5 ng/jil (Clontech Laboratories Inc., Palo Alto), 1 fil of genomic DNA 
sample in a final volume of 20 pi using a PTC-200 thermocycler (MJ Research Inc.). 
40 The optimal cycling conditions for maximum sensitivity and specificity were 3 
minutes at 95 °C for initial denaturation, then forty cycles of two steps consisting of 1 
second at 95 °C and 30 seconds at 62 °C, followed by terminal extension at 72 °C for 
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2 minutes. Amplification was monitored on agarose gel electrophoresis by staining 
the DNA with ethidium bromide. 

Specificity of the assay was tested by adding into the PCR reactions, 0.1 ng of 
genomic DNA from each of the bacterial species listed in Table 8 Strong 
amplification was observed only for the 5 S. agalactiae strains listed. Of the other 
bacterial species, including 32 species representative of the vaginal flora and 27 other 
streptococcal species, only S. acidominimus yielded amplification. The signal for 0.1 
ng of 5. acidominimus genomic DNA was weak and the detection limit for this 
species was 10 pg (corresponding to more than 4000 genome copies) while the 
detection limit for S. agalactiae was 2.5 fg (corresponding to one genome copy) of 
genomic DNA. 

To increase the specificity of the assay, internal probes were designed for FRET 
15 (Fluorescence Resonance Energy Transfer) detection using the LightCycler (Idaho 
Technology). As illustrated in Annex IX, a multiple sequence alignment of 
streptococcal ft*/ sequence fragments corresponding to the 252 bp region amplified by 
primers SEQ ID NO. 549 and SEQ ID NO. 550, was used for the design of internal 
probes TSagHF436 (SEQ ID NO. 582) and TSagHF465 (SEQ ID NO. 583). The 
20 region of the amplicon selected for internal probes contained sequences unique and 
specific to S. agalaFtiaeVSEQ ID NO. 583, the more specific probe, is labelled with 
fluorescein in 3', while SEQ ID NO. 582, the less discriminant probe, is labelled with 
CY5 in 5' and blocked in 3' with a phosphate group. However, since the FRET signal 
is only emitted if both probes are adjacently hybridized on the same target amplicon, 
25 detection is highly specific. 

Real-time detection of PCR products using the LightCycler™ was carried out using 
0 4 uM of each primer (SEQ ID NO. 549-550), 0.2 /iM of each probe (SEQ ID NO. 
582-583), 2.5 mM MgCl 2 , BSA 450 /xg/ml, IX PC2 Buffer (AB Peptides, St-Louis, 

30 MO), dNTP 0.2 mM (Pharmacia), KlenTaql™ DNA polymerase 0.5 U (AB 
Peptides) 0.025 U/pl combined with TaqStart (Clontech Laboratories Inc., Palo Alto), 
0 7 /il of genomic DNA sample in a final volume of 7 pi using a LightCycler 
thermocycler (Idaho Technology). The optimal cycling conditions for maximum 
sensitivity and specificity were 3 minutes at 94 °C for initial denaturation, then forty 

35 cycles of three steps consisting of 0 second (this setting meaning the LightCycler will 
reach the target temperature and stay at it for its minimal amount of time) at 94 °C, 
10 seconds at 64 °C, 20 seconds at 72 °C. Amplification was monitored during each 
annealing steps using the fluorescence ratio. The streptococcal species having close 
sequence homologies with the tuf sequence of S. agalactiae (S. acidominimus, S. 

40 anginosus, S. bovis, S. dysgalactiae, 5. equi, S. ferus, 5. gordonii, S. intermedins, S. 
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parasanguis, S. parauberis, S. salivartus, S. sanguis, S. suts, and of course S 
agalactiae) were tested in the LightCycler with 0.07 ng of genormc DNA per 
reaction. This time, only S. agalactiae yielded an amplification signal, hence 
demonstrating that the assay is species-specific. With the LightCycler assay using the 
5 internal FRET probes, the detection limit for 5. agalactiae was 12.5 fg 
(corresponding to five genome copies) of genomic DNA. 

EXAMPLE 6: 

10 * r ^«r identifie r™ nf Stmtococcm aPalactiae using atpD sequences . As shown 
in Annex XIV, the comparison of atpD sequences from a variety of bacterial species 
allowed the selection of PCR primers specific for S. agalactiae. The primer design 
strategy is similar to the strategy described in the preceding example except that atpD 
sequences were used in the alignment (see Annex X). 

15 Four primers were selected, ASag42 (SEQ ID NO. 627), ASag52 (SEQ ID NO. 628), 
ASag206 (SEQ ID NO. 625) and ASag371 (SEQ ID NO. 626). The following 
combinations of these four primers give four amplicons; SEQ ID NO. 62 <l ' + i SEQ ! ID 
NO 625 = 190 bp, SEQ ID NO. 628 + SEQ ID NO. 625 = 180 bp, SEQ ID NO. 627 

20 + SEQ ID NO. 626 = 355 bp, and SEQ ID NO. 628 + SEQ ID NO. 626 = 345 bp. 

Standard PCR was carried out on PTC-200 thermocyclers (MJ Research Inc) using 
04 uM of each primers pair, 2.5 mM MgCl 2 , BSA 0.05 mM, IX tag Buffer 
(Promega), dNTP 0.2 mM (Pharmacia), 1 fil tag DNA polymerase (Promega) 0.025 

25 U/ul combined with TaqStart 5 ng/^1 (Clontech Laboratories Inc., Palo Alto), 1 id of 
genomic DNA sample in a final volume of 20 ^1 using a PTC-200 thermocycler (MJ 
Research Inc.). The optimal cycling conditions for maximum sensitivity and 
specificity were adjusted for each primer pair. Three minutes at 95 °C for initial 
denaturation, then forty cycles of two steps consisting of 1 second at 95 C and 30 

30 seconds at the optimal temperature specified below were followed by termina 
extension at 72 °C for 2 minutes. Amplification was monitored on agarose gel 
electrophoresis by staining the DNA with ethidium bromide. Since atpD sequences 
are relatively more specific than tuf sequences, only the more closely related species 
namely, the steptococcal species listed in table 9, were tested. 

All four primer pairs only amplified the six S. agalactiae strains. With an annealing 
temperature of 63 °C, the primer pair SEQ ID NO. 627 + SEQ ID NO. 625 had a 
sensitivity of 1-5 fg (equivalent to 1-2 genome copies). At 55 °C, the primer pair SEQ 
ID NO 628 + SEQ ID NO. 625 had a sensitivity of 2.5 fg (equivalent to 1 genome 
40 copy). At 60 °C, the primer pair SEQ ID NO. 627 + SEQ ID NO. 626 had a 
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sensitivity of 10 fg (equivalent to 4 genome copies). At 58 °C, the pnmer pmr SEQ 
D NO «8 + SEQ ID NO. 626 had a sensitivity of 2.5-5 fg (equ.valent to 1-2 
genome copies). This proves that all fonr primer pairs can detec, S. agalacnae wrth 
Specificity and sensitivity. Togemer with example 5, this example demonstrates 
5 mat bo* "/ and a, P D sequences are suitab.e target for me idenuf.cat.on of 
microorganisms at the species level. 

EXAMPLE 7: 

10 QevelQEB^t ~f a PCR — for Secti on and identification of staphylococci at 
gpTins and species levels. 

Materials and Methods 

,5 Bacterial strains. The specificity of the PCR assay was verified by using a panel of 
ATCC (America Type Culture Collection) and DSMZ (Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH ; German Collection of Microorganisms 
and Cell Cultures) reference strains consisting of 33 gram-negative and 47 gram- 
positive bacterial species (Table 12). In addition, 295 clinical isolates ^nting U 
20 different species of staphylococci from the microbiology laborato ry o ^ C«ta 
Hospitaller Universitaire de Quebec, Pavilion Centre Hospitaller de- 1 Universite 
Laval (CHUL) (Ste-Foy, Quebec, Canada) were also tested to further validate the 
Staphylococcus-*^** PCR assay. These strains were all identified by using (i) 
conventional methods or (ii) the automated MicroScan Autoscan-4 system equipped 
25 with the Positive BP Combo Panel Type 6 (Dade Diagnostics, Mississauga, Ontario, 
Canada). Bacterial strains from frozen stocks kept at -80 JC in brain heart infusion 
(BHI) broth containing 10% glycerol were cultured on sheep blood agar or m BHI 
broth (Quelab Laboratories Inc, Montreal, Quebec, Canada). 

30 PCR primers and internal probes. Based on multiple sequence alignments, regions 
of the tuf gene unique to staphylococci were identified. Staphylococcus genus-specific 
PCR primers^ TStaG422 (SEQ ID NO. 553) and TStaG765 (SEQ ID NO 575) were 
derived from these regions (Annex XII). These PCR primers are displaced by rwo 
nucleotide positions compared to original Staphylococcus genus-specific PCR ^primers 

35 described in previous patent application WO98/20157 (SEQ ID NOs. 17 and 20 in the 
said patent application). These modifications were done to ensure specificity and 
ubiquity of the primer pair, in the light of new tuf sequence data revealed in the present 
patent application for several additional streptococcal species and strains. 

40 Similarly, sequence alignments analysis were performed to design genus and species- 
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specific internal probes (see Annexes XIII to XVI). Two intenud probes for 
Staphylococcus-^* (SEQ ID NOs. 605-606), five for S aureus (SEQ ID NO. 584- 
588) five for 5. epidermidis (SEQ ID NO. 589-593), two for S. haemolyticus (SEQ ID 
NOs 594-595) three for S. hominis (SEQ ID NOs. 596-598), four for S. saprophytics 
5 (SEQ ID NOs 599-601 and 695), and two for coagulase-negative Staphylococcus 
' species S. epidermidis, S. hominis, S. saprophytics, S. auricularis ^capins S 
haemolyticus, S. lugdunensis, S. simulans, S. cohnii and 5. warnen (SEQ ID NOs. 
1175-1176) were designed. The range of mismatches between the Staphylococcus- 
specific 371-bp amplicon and each of the 20-mer species-specific internal probes was 
10 from 1 to 5, in the middle of the probe when possible. No mismatches were present in 
the two Staphylococcuses probes for the 11 species analyzed : S aureus, 5. 
auricularis, S. capitis, 5. cohnii, S. epidermidis, S. haemolyticus, S. homims, S. 
lugdunensis, S. saprophytics, S. simulans and S. warneri. In order to verify the intra- 
specific sequence conservation of the nucleotide sequence, sequences were obtained for 
15 the 371-bp amplicon from five unrelated ATCC and clinical strains for each of the 
species S. aureus, 5. epidermidis, S. haemolyticus. S. hominis and S. saprophytics. The 
Oligo™ (version 5.0) primer analysis software (National Biosciences, Plymouth 
Minn ) was used to confirm the absence of self-complementary regions within and 
between the primers or probes. When required, the primers contained inosines or 
20 degenerated nucleotides at one or more variable positions. Oligonucleotide pnmers and 
probes were synthesized on a model 394 DNA synthesizer (Perkin-Elmer Corp., 
Applied Biosystems Division, Mississauga, Ontario, Canada). Detection of the 
hybridization was performed with the DIG-labeled dUTP incorporated during the 
amplification with the Staphylococcus- specific PCR assay, and the hybridization signal 
25 was detected with a luminometer (Dynex Technologies) as described above in the 
section on luminescent detection of amplification products. Annexes XIII to XVI 
illustrate the strategy for the selection of several internal probes. 



30 



PCR amplification. For all bacterial species, amplification was performed from 
purified genomic DNA or from a bacterial suspension whose turbidity was adjusted to 
that of a 05 McFarland standard, which corresponds to approximately 1.5 x 10 
bacteria per ml. One nanogram of genomic DNA or 1 *U of the standardized bacterial 
suspension was transferred directly to a 19 ^1 PCR mixture. Each PCR reaction 
contained 50 mM KC1, 10 mM Tris-HCl ( P H 9.0), 0.1% Triton X-100, 2.5 mM MgCl 2 
0 2 |xM (each) of the two Staphylococcus genus-specific primers (SEQ ID NOs. 553 
and 575), 200 \xM (each) of the four deoxynucleoside triphosphates (Pharmacia 
Biotech), 3.3 ng/pil bovine serum albumin (BSA) (Sigma-Aldrich Canada Ltd, Oakville, 
Ontario, Canada), and 0.5 U Tag polymerase (Promega) coupled with ^ Start 
40 Antibody (Clontech). The PCR amplification was performed as follows : 3 mm. at 94 

41 
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SiSl — on'agLse ge, electrophoresis by staining the amplified 
DNA with ethidium bromide. 

Fnr determination of the sensitivities of the PCR assays, two-fold dilutions of 
purified genomic DNA were used to determine the minima, number of genome cop.es 
which can be detected. 

Results 

Amnlifications with the Staphylococcus genus-specific PCR assay. The specificity 
ofrf^Lssed by penning 30-cycie and 

with the oanel of gram-positive (47 species from s genera; and graurnegao^ w „ 
lcieTf^22 g^bacterial species Usted in Table 12. The PCR assay was able to 
27 < rf 27 staphylococcal species tested in bom 30-cycle and 40-cyc.e regmrens. 
Fo^VcycL PCR all bacterial species tested other than staphyloma*, were negahve. 
Fo 40-cycle PCR, Enurococcus faecalis, Lactobacillus acidophilus, ^ococcus 
tZisMacrococc*, caseolyUcus, Streptocuccus agaladae and S. mu,ans were shghtly 
" to me SropMoc/cc^-specific PCR assay. The other species tested remamed 
SL mirt tests performed on a collection of 295 clinical tsola.es provided by 
"he tncrobiotogy laboratory of me Centre Hospitalier Universale de Quebec, 
pTvito Centte HosptoUer de rUniversite Laval (CHUL), including Staphylococcus 
I^r ( „=34). S. aLularis (n=2), S. capitis (n=19), S. cohnii (n=5). S ep,derm,d* 
ZZ S Lmolyticus (n=21), S. hominis (n=73). S. lugdunens* (n=17) S. 
Lwphyncus (n=6), S. simulans (n=3), S. «ri (n=32) and S,aphylococcus sp. 
n^sV showed a uniform amplification signal with the 30-cycle PCR assays and a 
perfect relation between the genotype and classical identification schemes. 

The sensitivity of the SrapMococcas-specific assay with 30-cycle and 40-cyde PCR 
pro^oTwas determined by using purified genomic DNA from the 11 staphylococcal 
ScS prevTously mentioned. For PCR with 30 cycles, a detection limit of 50 cop.es of 
gTnomlc DNA was consistently obtained. In order to enhance the sensmvtfy of the 
Ssay the number of cycles was increased. For 40-cycle PCR assays, the detecton hm. 
wL l'oweLl to a r ange of 5-10 genome copies, depending on the staphylococcal 
species tested. 

Hybridization between the S<^*yfoc<.c«« -specific 371-bp amplicon and species- 
"ciflc or genus-specific internal probes. Inter-species polymorphism wa 
, sufficient to generate species-specific internal probes for each of the pnncpal 
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soecies involved in human diseases (S. aureus, S. epidermidis, S. haemolyticus, S. 
hominis and 5. saprophytics). In order to verify the intra-species sequence 
conservation of the nucleotide sequence, sequences comparisons were performed on 
the 371-bp amplicon from five unrelated ATCC and clinical strains for each of 5 
staphylococcal species: S. aureus, S. epidermidis, S. haemolyticus, S. hominis and S. 
saprophytics. Results showed a high level of conservation of nucleotide sequence 
between different unrelated strains from the same species. This sequence information 
allowed the development of staphylococcal species identification assays using 
species-specific internal probes hybridizing to the 371-bp amplicon. These assays are 
specific and ubiquitous for those five staphylococcal species. In addition to the 
species-specific internal probes, the genus-specific internals probes were able to 
recognize most Staphylococcus species. 



15 EXAMPLE 8: 

p^^^ p h.twe.e.n the t ^ Hn.elv related v^st species Candida albicans and 
rw,v/„ dubliniensis. It is often useful for the clinician to be able to differentiate 
between two very closely related species of microorganisms. Candida albicans is the 
20 most important cause of invasive human mycose. In the recent years, a very closely 
related species, Candida dubliniensis, was isolated in immunosuppressed patients. 
These two species are difficult to distinguish by classic biochemical methods. This 
example demonstrates the use of tuf sequences to differentiate Candida albicans and 
Candida dubliniensis. PCR primers TCal528 and TCal676 were selected for their 
25 ability to specifically amplify a tuf (elongation factor 1 alpha type) fragment from 
both species (see Annex XI for primer positions and previous patent application 
WO98/20157 for SEQ ID NOs. 11-12). Within this tuf fragment, a region 
differentiating C. albicans and C. dubliniensis by two nucleotides was selected and 
used to design two internal probes (see Annex XI for probe design, SEQ ID NOs. 577 
30 and 578) specific for each species. Amplification of genomic DNA from C. albicans 
and C. dubliniensis was carried out using DIG-ll-dUTP as described above in the 
section on chemiluminescent detection of amplification products. Internal probes 
SEQ ID NOs 577 and 578 were immobilized on the bottom of individual microtiter 
plates and hybridization was carried out as described above in the above section on 
35 luminescent detection of amplification products. Luminometer data showed that the 
amplicon from C. albicans hybridized only to probe SEQ ID NO. 577 while the 
amplicon from C dubliniensis hybridized only to probe SEQ ID NO. 578, thereby 
demonstrating that each probe was species-specific. 



40 EXAMPLE 9: 
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SS5 ftelocyclers (MI Research), the cycling conditions for - senary 

^««sU were 3 min. at 94 -C for initia. d-^^J^ 
,o of two steps consisting of 1 second at 95 °C and 30 seconds at 55 C, followed by 
10 JrTaTe— at 72 -C for 2 minutes. AmpUfication was motored o, ^agarose 

eel electrophoresis by staining the amplified DNA w.th etradium brom.de. The three 
could detect the equivalent of less than 200 * £~ 

Specificity was tested using 0.5 ng of purified genomic DNA &om » pane- o. 
,5 microorganisms including Babesia bovis, Babesia micro*. Canada albtcans, 

^tatate Leishmania major. Leishmania heriigi and Neospora canmum 
DNA could be amplified, thereby suggesting that the assay was 

species-specific. 

20 EXAMPLE 10: 

. irtrntifi -^- ^ Cfcfegaydie mehSOBtk- Upon analysis of m/ sequence 
S « possible ,o find two regions where Chlamydia 

remined conserved while other species have diverged. Primers Ctr82 (SEQ ID NO. 
25 "sTL CU249 (SEQ ID NO. 555) were designed. With the PTC-200 mermocyclers 
(MJ Research), the optimal cycling conditions for maximum sensitivity and 
specificity were determined to be 3 min. a. 94 °C for initial denamranor, then forty 
c^es of two steps consisting of 1 second at 95 -C and 30 seconds at 60 "C, foUowed 
by tTrminal extension a. 72 -C for 2 minutes. Amplification .was ; stored on 
30 agamse gel electtophoresis by sunning the amplified DNA wtm 

The assay could detect the equivalent of 8 C. trachomatis genome cop.es. Spectficity 
w^tesid on 0.1 ng of purified genomic DNA from a panel of rn.croorgan.sms 
toll g 22 species commonly encountered in the vaginal flora (Bacdlus subnhs. 
BactZL, Litis. Candida albicans. Clostridium difficile, Corynebactenum 
35 cTIZ Co^ebacterium urealyticun, Enterococcus faecalis. fnterococcus 
Zium. Escherichia coU. Fusobacterium nucleatum. Gardnerella vagmahs. 

**— * o^toca. Lactobacillus ~ 

niter Peptostreptococcus prevotii. Porphyromonas asaccharolyttca Prevotella 
Zaninogenica Propionibacterium acnes. Staphylococci* aureus. 
^idominL-s, and Streptococcus agalactiae). Only C. trachoma,, DNA could be 
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amplified, thereby suggesting that the assay was species-specific. 
EXAMPLE 11: 

5 GenuSzS Eggmc jd en^icato Of enterococci. Upon analysis of tuf sequence data and 
comparison with the repertory of tuf sequences, it was possible to find two regions 
where Enterococcus sequences remained conserved while other genera have diverged 
(Annex XVII). Primer pair Encg313dF and Encg599c (SEQ ID NOs. 1 137 and 1136) 
was tested for its specificity by using purified genomic DNA from a panel of bacteria 
10 listed in Table 10. Using the PTC-200 thermocycler (MJ Research), the optimal 
cycling conditions for maximum sensitivity and specificity were determined to be 3 
min at 94 °C for initial denaturation, then forty cycles of two steps consisting of 1 
second at 95 °C and 30 seconds at 55 °C, followed by terminal extension at 72 °C for 
2 minutes. Amplification was monitored on agarose gel electrophoresis by staining 
15 the amplified DNA with ethidium bromide. The 18 enterococcal species listed in 
Table 10 were all amplified efficiently. The only other species amplified were 
Abiotrophia adiacens, Gemella haemolysans and Gemella morbillorum, three gram- 
positive species. Sensitivity tested with several strains of E. casseliflavus, E faecium, 
E faecalis, E flavescens and £. gallinarum and with one strain of each other 
20 Enterococcus species listed in Table 10 ranged from 1 to 10 genome copies. The 
sequence variation within the 308-bp amplicon was sufficient so that nested PCR or 
internal probes could be used to speciate the amplicon and differenciate enterococci 
from Abiotrophia adiacens, Gemella haemolysans and Gemella morbillorum, thereby 
allowing to achieve excellent specificity. Species-specific internal probes were 
25 generated for each of the clinically important species, E. faecalis (SEQ ID NO. 
1174) E faecium (SEQ ID NO. 602), and the E. casseliflavus, E. flavescens, E. 
gallinarum group (SEQ ID NO. 1122) (Annex XVIII). The species-specific internal 
probes were able to differentiate their respective Enterococcus species from all other 
Enterococcus species. These assays are sensitive, specific and ubiquitous for those 
30 five Enterococcus species. 

EXAMPLE 12: 

identification of the maior bacterial platele ts c ontaminan ts using tuf sequences in a 
35 multi plex test. Blood platelets preparations need to be monitored for bacterial 
contaminations. The tuf sequences of 17 important bacterial contaminants of platelets 
were aligned. As shown in Annex XIX, analysis of these sequences allowed the 
design PCR primers. Since in the case of contamination of platelet concentrates, 
detecting all species (not just the more frequently encountered ones) is desirable, 
40 perfect specificity of primers was not an issue in the design. However, sensitivity is 
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important. That is why, to avoid having to put too much degeneracy only the most 
frequent contaminants were included in primer design, knowing that the selected 
primers would anyway be able to amplify more species than the 17 used in the 
Lien Oligonucleotide sequences which are conserved in these 17 major bacterial 
c—a^ts of platelet concentrates were chosen (oligos Tplaq 769 and Tplaq 991, 
respectively SEQ ID NOs. 636 and 637) thereby permitting the detection of these 
bacterial species. However, sensitiVity was slightly deficient with staphylococci. To 
ensure maximal sensitivity in the detection of all the more frequent bacterial 
contaminants, a multiplex assay also including oligonucleotide primers ^getting the 
Staphylococcus genera (oligos Stag 422, SEQ ID NO. 553; and Stag 765, SEQ ID 
NO 575) was developed. The bacterial species detected with the assay are listed in 
Table 14. 

The primer pairs, oligos SEQ ID NO. 636 and SEQ ID 

amplication product of 245 pb, and oligos SEQ ID NO. 553 and SEQ ID NO. 575 
thai give an amplification product of 368 pb, were used simultaneously in the 
multiplex PCR assay. Real-time detection of these PCR products was made on the 
LightCycler thermocycler (Idaho Technology) using SYBR Green I (Molecular 
Probe Inc ) SYBR® Green I is a fluorescent dye that binds specifically to double- 
stranded DNA. It thus bindJLto DNA products as they are synthesized. The measure 
of SYBR® Green I fluorescence at the end of each elongation cycle indicates the 
amount of DNA duplex generated by specific DNA fragment amplification and 
primer-dimer formation. 

Fluorogenic detection of PCR products with the LightCycler was carried out using 
1.0 mM of both Tplaq primers (SEQ ID NOs. 636-637) and 0.4 ^mM of both TStaG 
primers (SEQ ID NOs. 553 and 575), 2.5 mM MgCl 2 , BSA 500 mg/ml , dNTP 0 2 
mM (Pharmacia), 10X PCR reaction buffer (Boerhinger Mannheim) and tag DNA 
polymerase (Boerhinger Mannheim) 0.025 U/ml combined with TaqStart 5 ng/ml 
(Clontech), and 0.7 ml of genomic DNA sample in a final volume of 7 ml. The 
optimal cycling conditions for maximum sensitivity were 1 minute at 94 C for initial 
denaturation, then forty-five cycles of three steps consisting of 0 second at 95 C, 5 
seconds at 60 °C and 9 seconds at 72 °C. Amplification was monitored during each 
elongation cycle by measuring the level of SYBR® Green I. However, real analysis 
takes place after PCR. Melting curves are done for each sample and transformation of 
the melting peak allows determination of Tm. Thus primer-dimer and specific PCR 
product are discriminated. With this assay, all prominent bacterial contaminants of 
Platelet concentrates listed in Annex XIX and Table 14 were detected. SensiUvity 
tests were performed on the 9 most frequent bacterial contaminants of platelets. The 
i detection limit was less than 20 genome copies for E. cloacae, B. cereus, S. 
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choleraesuis and S. marcescens; less than 15 genome copies for P. aeruginosa; and 2 
to 3 copies were detected for 5. aureus, S. epidermidis, E. coli and K. pneumoniae. 
Further refinements of assay conditions should increase sensitivity levels. 

5 EXAMPLE 13: 

The resolving r CT of thf > "' f » nA atoD s ™" ences databases is comparable to the 
biochemical met res for bacterial identificati on. The present gold standard for 
bacterial identification is mainly based on key morphological traits and batteries of 
10 biochemical tests. Here we demonstrate that the use of tuf and atpD sequences 
combined with simple phylogenetic analysis of databases formed by these sequences 
is comparable to the gold standard. In the process of acquiring data for the tuf 
sequences, we sequenced the m/gene of a strain that was given to us labelled as 
Staphylococcus hominis ATCC 35982. That tuf sequence (SEQ ID NO. 192) was 
15 incorporated into the tuf sequences database and subjected to a basic phylogemc 
analysis using the Pileup command from version 10 of the GCG package (Genetics 
Computer Group, inc.). This analysis indicated that SEQ ID NO. 192 is not 
associated with other S. hominis strains but rather with the S. warneri strains^The 
ATCC 35982 strain was sent to the reference laboratory of the Laboratoire de Sante 
20 publique du Quebec (LSPQ). They used the classic identification scheme for 
" staphylococci (Kloos and Schleifer, 1975., J. Clin. Microbiol. 1:82-88). Their results 
shown that although the colonial morphology could correspond to S. hominis, the 
more precise biochemical assays did not. These assays included discriminant 
mannitol, mannose and ribose acidification tests as well as rapid and dense growth in 
25 deep thioglycolate agar. The LSPQ report identified strain ATCC 35982 as S 
wameri which confirms our database analysis. The same thing happened for S. 
warneri (SEQ ID NO. 187) which had initially been identified as S. haemolyticus by 
a routine clinical laboratory using a low resolving power automated system 
(MicroScan, AutoScan-4™). Again, the tuf and LSPQ analysis agreed on its 
30 identification as S. warneri. In numerous other instances, in the course of acquiring 
tuf and atpD sequence data from various species and genera, analysis of our tuf 
and/or atpD sequence databases permitted the exact identification of mislabelled or 
erroneously identified strains. These results clearly demonstrate the usefulness and 
the high resolving power of our sequence-based identification assays using the tuf 
35 and atpD sequences databases. 

EXAMPLE 14: 

Detection of group B strepto cocci in clinical specimens. 



40 
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Introduction 

Streptococcus agalactiae, the group B streptococcus (GBS), is responsible for a 
severe illness affecting neonate infants. The bacterium is passed from the healthy 

5 carrier mother to the baby during delivery. To prevent this infection, it is 
recommended to treat expectant mothers susceptible of carrying GBS m their 
anovaginal flora. Carrier status is often a transient condition and rigorous monitoring 
requires cultures and classic bacterial identification weeks before delivery. To 
improve the diagnostic and identification of GBS we developped a rapid, specific and 

10 sensitive PCR test fast enough to be performed right at delivery. 

Materials and Methods 



15 



20 



25 



GBS Clinical Specimens. A total of 66 duplicate anovaginal swabs were 
collected from 41 consenting pregnant women admitted for delivery at the Centre 
Hospitalier Universitaire de Quebec, Pavilion Saint-Francois d' Assise following the 
CDC recommendations. The samples were obtained either before or after rupture of 
membranes. The swab samples were tested at the Centre de Recherche en 
Infectiologie de l'Universite Laval within 24 hours of collection. Upon receipt, one 
swab was cut and then the tip of the swab was added to GNS selective broth for 
identification of -group B streptococci (GBS) by the standard culture methods 
recommended by the Center for Diseases Control. The other swab was processed 
following the instruction of the IDI DNA extraction kit (Infectio Diagnotics (IDI) 
Inc.) prior to PCR amplification. 

Oligonucleotides. PCR primers, Tsag340 (SEQ ID NO. 549) and Tsag552 
(SEQ ID NO. 550) complementary to the regions of the tuf gene unique for GBS 
were designed based upon a multiple sequence alignment using our repertory of tuf 
sequences. Oligo primer analysis software (version 5.0) (National Biosciences) was 
used to analyse primers annealing temperature, secondary structure potential as well 
30 as mispriming and dimerization potential. The primers were synthesized using a 
model 391 DNA synthesizer (Perkin-Elmer). 

A pair of fluorescently labeled adjacent hybridization probes Sag465-F (SEQ ID 
NO. 583) and Sag436-C (SEQ ID NO. 582) were synthesized and purified by Operon 
Technologies. They were designed to meet the recommendations of the manufacturer 
35 (Idaho Technology) and based upon multiple sequence alignment analysis using our 
repertory of tuf sequences to be specific and ubiquitous for GBS. These adjacent 
probes, which are separated by one nucleotide, allow fluorescence resonance energy 
transfer (FRET), generating an increased fluorescence signal when both hybridized 
simultaneously to their target sequences. The probe SEQ ID NO. 583 was labeled 
40 with FITC in 3 prime while SEQ ID NO. 582 was labeled with Cy5 in 5 prime. The 
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Cy5-labeled probes contained a 3'-blocking phosphate group to prevent extension of 
the probes during the PCR reactions. 

PCR Amplification. Conventional amplifications were performed either from 2 
III of a purified genomic DNA preparation or cell lysates of anovaginal specimens. 
5 The 20 fil PCR mixture contained 0.4 fiM of each GBS-specific primer (SEQ ID 
NOs. 549-550), 200 fiM of each deoxyribonucleotide (Pharmacia Biotech), 10 mM 
Tris-HCl (pH 9.0), 50 mM KC1, 0.1% Triton X-100, 2.5 mM MgCl 2 , 3.3 mg/ml 
bovine serum albumin (BSA) (Sigma), and 0.5 U of Tag polymerase (Promega) 
combined with the TaqStart antibody (Clontech). The TaqStart antibody, which is a 
10 neutralizing monoclonal antibody of Taq DNA polymerase, was added to all PCR 
reactions to enhance the efficiency of the amplification. The PCR mixtures were 
subjected to thermal cycling (3 min at 95 °C and then 40 cycles of 1 s at 95 °C, and 
30 s at 62 °C with a 2-min final extension at 72 °C) with a PTC-200 DNA Engine 
thermocycler (MJ research). The PCR-amplified reaction mixture was resolved by 
15 agarose gel electrophoresis. 

The LightCycler™ PCR amplifications were performed with 1 fil of the same 
preparation as described above. The lOpil amplification mixture consisted of 0.4 iiM 
each GBS-specific primer (SEQ ID NOs. 549-550), 200 /xM each dNTP, 0.2 iiM each 
fluorescently labeled probe (SEQ ID NOs. 582-583), 300 iig/ml BSA (Sigma), and 1 
20 fil of lOx PC2 buffer (containing 50 mM Tris-HCl (pH 9.1), 16 mM ammonium 
sulfate, 3.5 mM Mg 2+ , and 150 itg/ml BSA) and 0.5 U KlenTaql™ (AB Peptides) 
coupled with TaqStart™ antibody (Clontech). KlenTaql™ is a highly active and 
more heat-stable DNA polymerase without 5'-exonuclease activity. This prevents 
hydrolysis of hybridized probes by the 5' to 3' exonuclease activity. A volume of 7 fil 
25 of the PCR mixture was transferred into a composite capillary tube (Idaho 
Technology). The tubes were then centrifuged to move the reaction mixture to the 
tips of the capillaries and then cleaned with optical-grade methanol. Subsequently the 
capillaries were loaded into the carousel of a LC32 LightCycler™ (Idaho 
Technology), an instrument that combines rapid-cycle PCR with fluorescence 
30 analysis for continuous monitoring during amplification. The PCR reaction mixtures 
were subjected to a denaturation step at 94 °C for 3 min followed by 45 cycles of 0 s 
at 94 "C, 20 s at 64 °C and 10 s at 72 °C with a temperature transition rate of 20 °C/s. 
Fluorescence signals were obtained at each cycle by sequentially positioning each 
capillary on the carousel at the focus of optical elements affiliated to the built-in 
35 fluorimeter for 100 milliseconds. Complete amplification and analysis required about 
35 min. 

Specificity And Sensitivity Tests. The specificity of the conventional and 
LightCycler™ PCR assays was verified by using purified genomic DNA (0.1 
ng/reaction) from a battery of ATCC reference strains representing 35 clinically 
40 relevant gram-positive species (Abiotrophia defectiva ATCC 49176, Bifidobacterium 
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breve ATCC 15700, Clostridium difficile ATCC 9689, Corynebacterium urealyticum 
ATCC 43042, Enterococcus casseliflavus ATCC 25788, Enterococcus durans ATCC 
19432, Enterococcus faecalis ATCC 29212, Enterococcus faecium ATCC 19434, 
Enterococcus gallinarum ATCC 49573, Enterococcus raffinosus ATCC 49427, 
Lactobacillus reuteri ATCC 23273, Lactococcus lactis ATCC 19435, Listeria 
monocytogenes ATCC 15313, Peptococcus niger ATCC 27731, Peptostreptococcus 
anaerobius ATCC 27337, Peptostreptococcus prevotii ATCC 9321, Staphylococcus 
aureus ATCC 25923, Staphylococcus epidermidis ATCC 14990, Staphylococcus 
haemolyticus ATCC 29970, Staphylococcus saprophytics ATCC 15305, 
Streptococcus agalactiae ATCC 27591, Streptococci anginas ATCC 33397, 
Streptococcus bovis ATCC 33317, Streptococcus constellatus ATCC 27823, 
S^ptococcws dysgalactiae ATCC 43078, Streptococcus gordonii ATCC 10558, 
Streptococcus mitis ATCC 33399, Streptococcus mutans ATCC 25175, 

Streptococcus oralis Aiuv, ^5uJ/, bucpiOCvC^u* p~.r~*** ■ - 

Streptococci pneumoniae ATCC 6303, Streptococcus pyogenes ATCC 19615, 
Streptococcus salivarius ATCC 7073, Streptococcus sanguinis ATCC 10556, 
Streptococci ttoem ATCC 19436). These microbial species included 15 species of 
streptococci and many members of the normal vaginal and anal floras. In addition, 40 
GBS isolates of human origin, whose identification was confirmed by the Latex 
agglutination test (Streptex, Murex), were also used to evaluate the ubiquity of the 

assay. - • *' — 

For determination of the sensitivities (i.e., the minimal number of genome 
copies that could be detected) for conventional and LightCycler™ PCR assays, serial 
10-fold or 2-fold dilutions of purified genomic DNA from 5 GBS ATCC strains were 
used. 

Results 

Evaluation of the GBS-specific conventional and LightCycler™ PCR assay. 
The specificity of the two assays demonstrated that only DNAs from GBS strains 
could be amplified. Both PCR assays did not amplify DNAs from any other bacterial 
species tested including 14 streptococcal species other than GBS as well as 
phylogenetically related species belonging to the genuses Enterococcus, 
Peptostreptococcus and Lactococcus. Important members of the vaginal or anal flora, 
including coagulase-negative staphylococci, Lactobacillus sp., and Bacteriodes sp. 
were also negative with the GBS-specific PCR assay. The LightCycler™ PCR assays 
detected only GBS DNA by producing an increased fluorescence signal which was 
interpreted as a positive PCR result. Both PCR methods were able to amplify all of 
40 GBS clinical isolates, showing a perfect correlation with the phenotypic 
identification methods. 

The sensitivity of the assay was determined by using purified genomic DNA 
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from the 5 ATCC strains of GBS. The detection limit for all of these 5 strains was 
one genome copy of GBS. The detection limit of the assay with the LightCycler 
was 3.5 fg of genomic DNA (corresponding to 1-2 genome copies of GBS). These 
results confirmed the high sensitivity of our GBS-specific PCR assay. 

Direct Detection of GBS from anovaginal specimens. Among 66 anovaginal 
specimens tested, 12 were positive for GBS by culture. 11 of them were also 
identified by both PCR assays. The sensitivity of both PCR methods with 
vaginal/anal specimens for identifying colonization status in pregnant women at 
delivery was 91.7% when compared to culture results. The specificity and positive 
) predictive values were both 100% and the negative predictive value was 97.8%. The 
time for obtaining results was approximately 45 min for LightCycler PCR, 
approximately 100 min for conventional PCR and 48 hours for culture. 

Conclusion 

We have developed for the detection of GBS two PCR assays (conventional and 
LightCycler™) which are specific (i.e., no amplification of DNA from a variety of 
bacterial species other than GBS) and sensitive (i.e., able to detect around 1 genome 
copy for several reference ATCC strains of GBS). Both PCR assays are able to detect 
0 GBS directly from anovaginal specimens in a very short turnaround time. Using the 
real-time PCR assay on LightCycler™, we can detect GBS carriage in pregnant 
women at delivery within 45 minutes. 

EXAMPLE 15: 

'.5 - A 

Simultaneous identification of Streoto c nrr^ „ voge nes and its pyrogenic exotoxin A. 
The rapid detection of Streptococcus pyogenes and of its pyrogenic exotoxin A is of 
clinical importance. We developed a multiplex assay which permits the detection of 
strains of S. pyogenes carrying the pyrogenic toxin A gene, which is associated with 

30 scarlet fever and other pathologies. In order to specifically detect S. pyogenes, 
nucleotide sequences of the pyrroiidone carboxylyl peptidase gene (pep) were aligned 
to design PCR primers Spy291 (SEQ ID NO. 121 1) and Spy473 (SEQ ID NO. 1210). 
Next, we designed primers for the specific detection of the pyrogenic exotoxin A. 
Nucleotide sequences of the speA gene, carried on the bacteriophage T12, were 

35 aligned as shown in Annex XXIII to design PCR primers Spytx814 (SEQ ID NO. 
994) and Spytx 927 (SEQ ID NO. 995). 

The primer pairs : oligos SEQ ID NOs. 1210-1211, yielding an amplification product 
of 207 bp, and oligos SEQ ID NOs. 994-995, yielding an amplification product of 
40 135 bp, were used in the PCR assay. 
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PCR amplification was carried out using 0.4 jtiM of both pairs of primers, 2.5 mM 
MgCl 2 , BSA 0.05 fiM , dNTP 0.2 /iM (Pharmacia), 10X PCR reaction buffer 
(Promega), 0.025 U/ml Taq DNA polymerase (Promega) combined with TaqStart 
5 (Clontech Laboratories Inc.), and 1 ptl of genomic DNA sample in a final volume of 
20 ill. PCR amplification was performed using a PTC-200 thermal cycler (MJ 
Research). The optimal cycling conditions for maximum sensitivity were 3 minutes at 

94 °C for initial denaturation, then forty cycles of two steps consisting of 1 second at 

95 °C and 30 seconds at 63 °C, followed by a final step of 2 minutes at 72 °C. 
10 Detection of the PCR products was made by agarose gel (2%) electrophoresis 

containing 0.25 /ig/ml of ethidium bromide. Visualization of the PCR products was 
made under U V at 254 nm. 

The detection limit was less than 5 genome copies for both S. pyogenes and its 
15 pyrogenic exotoxin A. The assay was specific for pyrogenic exotoxin A-producing 5. 
pyogenes : strains of the 27 other species of Streptococcus tested were all negative, as 
well as 20 strains of various gram-positive and gram-negative bacterial species. 

A similar approach was used to design an alternative set of spM-specific primers 
20 (SEQ ID NOs. 996 to 998, see Annex XXIV). In addition^other set of primers 
based on the tuf gene (SEQ ID NOs: 999 to 1 00 1 , see Annex XXV) could be used to 
specifically detect Streptococcus pyogenes. 

EXAMPLE 16 : 

25 

Real-time identification of Shiga toxin-producing bacteria. Shiga toxin-producing 
Escherichia coli and Shigella dysenteriae cause bloody diarrhea. Currently, 
identification relies mainly on the phenotypic identification ofS. dysenteriae and E. 
coli serotype 0157:H7. However, other serotypes of E. coli are increasingly found to 

30 be producers of type 1 and/or type 2 Shiga toxins. Two pairs of PCR primers 
targeting highly conserved regions present in each of the Shiga toxin genes stxj and 
stx 2 were designed to amplify all variants of those genes (see Annexes XXVI and 
XXVII). The first primer pair, oligonucleotides 1SLT224 (SEQ ID NO. 1081) and 
1SLT385 (SEQ ID NO. 1080), yields an amplification product of 186 bp from the 

35 stxj gene. For this amplicon, the 1 SLTBl-Fam (SEQ ID NO. 1084) molecular beacon 
was designed for the specific detection of stxj using the fluorescent label 6-carboxy- 
fluorescein. A second pair of PCR primers, oligonucleotides 2SLT537 (SEQ ID NO. 
1078) and 2SLT678b (SEQ ID NO. 1079), yields an amplification product of 160 bp 
from the stx 2 gene. Molecular beacon 2SLTB1-Tet (SEQ ID NO. 1085) was designed 

40 for the specific detection of stx 2 using the fluorescent label 5-tetrachloro-fluorescein. 
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Both primer pairs were combined in a multiplex PCR assay. 

PCR amplification was carried out using 0.8 /xM of primers pair SEQ ID NOs. 1080- 
1081, 0.5 fiM of primer pair SEQ ID NOs. 1078-1079, 0.3 fiM of each molecular 
5 beacon, 8 mM MgCl 2 , 12.25 fig BSA, 0.2 mM dNTPs (Pharmacia), 50 mM Tris-HCl, 
16 mM NH4SO4, IX TaqMaster (Eppendorf), 2.5 U KlenTaql DNA polymerase (AB 
Peptides) combined with TaqStart (Clontech Laboratories Inc.), and 1 fil of genomic 
DNA sample in a final volume of 25 fi\. PCR amplification was performed using a 
SmartCycler thermal cycler (Cepheid). The optimal cycling conditions for maximum 
10 sensitivity were 60 seconds at 95 °C for initial denaturation, then 45 cycles of three 
steps consisting of 10 seconds at 95 °C, 15 seconds at 56 °C and 5 seconds at 72 °C. 
Detection of the PCR products was made in real-time by measuring the fluorescent 
signal emitted by the molecular beacon when it hybridizes to its target at the end of 
the annealing step at 56 °C. 

15 

The detection limit was the equivalent of less than 5 genome copies. The assay was 
specific for the detection of both toxins, as demonstrated by the perfect correlation 
between PCR results and the phenotypic characterization performed using antibodies 
specific for each Shiga toxin type. The assay was successfully performed on several 
20 Shiga toxin -producing strains isolated from various geographic areas of the world, 
including 10 0157:H7 E. coli, 5 non-0157:H7 E. coli and 4 5. dysenteriae. 

EXAMPLE 17: 

25 Development of a PCR assay for the detection and identification of staphylococci at 
genus and species levels and its associated mecA gene . The Staphylococcus genus- 
specific PCR primers described in Example 7 (SEQ ID NOs. 553 and 575) were used 
in multiplex with the mecA -specific PCR primers described in previous US patent 
serial no. 5,994,066 (SEQ ID NOs. 261 and 262 in the said patent) as well as with the 

30 5. awreMs-specific PCR primers SEQ ID NOs. 152 and 153 described in the said 
patent. Sequence alignment analysis of 10 publicly available mecA gene sequences 
were performed to design an internal probe for mecA (SEQ ID NO. 1177). An 
internal probe was also designed for the 5. a« raw-specific amplicon (SEQ ID NO 
1234). PCR amplification and agarose gel analysis of the amplified products were 

35 performed as described in Example 7, with the exception that 0.4 jtiM (each) of the 
two Staphylococcus genus-specific primers (SEQ ID NOs. 553 and 575) and 0.4 /xM 
(each) of the mecA gene primers and 0.4 /iM (each) of the S. awrews-specific primers 
were used in the PCR mixture. The specificity of the multiplex assay with 40-cycle 
PCR protocols was verified by using purified genomic DNA from five methicillin- 

40 resistant and fifteen methicillin-sensitive staphylococcal strains. The sensitivity of the 
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multiplex assay with 40-cycle PCR protocols was determined by using purified 
genomic DNA from five methicillin-resistant and seven methicillin-sensitive 
staphylococcal strains. A detection limit of 3-5 genome copies was obtained, 
depending on the staphylococcal species tested. Furthermore, the mecA -specific 
5 internal probe and the S. awrews-specific internal probe combined with (i) the species- 
specific internal probes, (ii) the genus-specific internal probes and (iii) the coagulase- 
negative staphylococci specific internal probes (described in Example 7) were able to 
recognize three methicillin-resistant staphylococcal strains with high sensitivity and 
specificity. 

10 The format of the assay is not limited to the one described above. A person skilled in 
the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 1232 for detection of 

the 5. attreus-speeifie amplieon, SEQ IB NO. 1233 for detection of ceagulase- 
15 negative staphylococci and SEQ ID NO. 1231 for detection of mecA. 

EXAMPLE 18: 

Sequencing of pbvla. pbp2b and pbp2x genes of Streptoccoccus pneumoniae. The 
20 comparison of publicly available pbpla, pbp2b and pbp2x sequences from a variety 
of S. pneumoniae strains revealed conserved regions allowing the design of PCR 
primers able to amplify pbpla, pbp2b, and pbp2x sequences of several strains of 5. 
pneumoniae having various levels of resistance to penicillin and third-generation 
cephalosporins. Using primer pairs Spnpbpla876 and Spnpbpla2163 (SEQ ID NOs. 
25 1125 and 1126), Spnpbp2b580 and Spnpbp2b2045 (SEQ ID NOs. 1142 and 1143), 
and Spnpbp2x469 and Spnpbp2x2212 (SEQ ID NOs. 1146 and 1147), it was possible 
to amplify and sequence pbpla sequences SEQ ID NOs. 1004-1018, pbp2b 
sequences SEQ ID NOs. 1019-1033, and pbp2x sequences SEQ ID NOs. 1034-1048. 
Six other PCR primers (SEQ ID NOs. 1 127-1 128, 1 144-1 145, 1148-1 149) were also 
30 designed and used to complete the sequencing of pbpla, pbp2b and pbp2x 
amplification products. 

EXAMPLE 19: 

35 

Sequencing of hexA genes of Streptococcus species . The hexA sequence of 5. 
pneumoniae described in previous US patent serial no. 5,994,066 (SEQ ID NO. 31 in 
the said patent, SEQ ID NO. 1 183 in the present application) allowed the design of a 
PCR primer (SEQ ID NO. 1 182) which was used with primer Spnl401 described in 
40 previous US patent serial no. 5,994,066 (SEQ TD NO. 156 in the said patent, SEQ ID 
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NO. 1179 in the present application) to amplify the hexA gene of one strain of S. 
oralis, three strains of S. mitts and four strains of S. pneumoniae (Annex XLII). Using 
primers SEQ ID NO. 1179 and SEQ ID NO. 1182, it was possible to amplify and 
sequence S. pneumoniae hexA (SEQ ID NOs. 1184-1187), S. mitis hexA (SEQ ID 
5 NOs. 1189-1 191) and 5. oralis hexA (SEQ ID NO. 1188). 



EXAMPLE 20: 

10 Development of a multiplex PCR assay for the detection of Streptococcus 
pneumoniae and its penicillin resistance genes . 

Material and Methods 

15 

Bacterial strains. The specificity of the multiplex PCR assay was verified by 
using a panel of ATCC (American Type Culture Collection) reference strains 
consisting of 33 gram-negative and 67 gram-positive bacterial species (Table 1 3). In 
addition, 55 clinical isolates of Streptococcus pneumoniae and 16 strains of S. mitis 

20 and 3 strains of S. oralis from thejnicrobiology laboratory of the Centre Hospitalier 
Universitaire de Quebec, Pavilion Centre Hospitalier de 1'Universite Laval (CHUL) 
(Ste-Foy, Quebec, Canada) and from the Laboratoire de sant6 publique du Quebec 
(LSPQ) (Sainte-Anne de Bellevue, Quebec, Canada) were also tested to further 
validate the Streptococcus pneumoniae-specific PCR assay. The penicillin MICs 

25 (minimal inhibitory concentration ) of 55 isolates were measured by the broth 
dilution method according to the recommended protocol of NCCLS. 

PCR primers and internal probes. The comparison of hexA sequences from a 
variety of streptococcal species described in Example 19 (SEQ ID NOs. 1184-1191) 

30 allowed the selection of a PCR primer specific for S. pneumoniae, Spnhexal613 
(SEQ ID NO. 1181). This primer was used with the S. pneumoniae species-specific 
primer SEQ ID NO. 1179 to generate an amplification product of 213 bp (Annex 
XLII). The PCR primer SEQ ID NO. 1 181 is located 87 nucleotides downstream on 
the hexA sequence compared to the original 5. pneumoniae species-specific PCR 

35 primer Spnl515 described in previous US patent serial no. 5,994,066 (SEQ ID NO. 
157 in the said patent). These modifications were done to ensure the design of the 5. 
pneumoniae species-specific internal probe according to the new hexA sequences of 
several close streptococcal species (SEQ ID NOs. 1184-1191). The comparison of 
pbpla sequences from 5. pneumoniae strains with various levels of penicillin 

40 resistance allowed the identification of amino acid substitutions Ile-459 to Met and 
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Ser-462 to Ala that occur in isolates with high-level penicillin resistance (MICs > 
l|ig/ml), and amino acid substitutions Ser-575 to Thr, Gln-576 to Gly and Phe-577 to 
Tyr that are common to all penicillin-resistant isolates with MICs > 0.25 fig/mL As 
shown in annex XXXI, PCR primer pair Spnpbplal365 and Spnpbplal747 (SEQ ID 
5 NOs. 1130 and 1131) were designed to detect high-level penicillin resistance (MICs 
> l|ig/ml), whereas PCR primer pair Spnpbplal591 (SEQ ID NO. 1 129) and SEQ ID 
NO. 1131 were designed to detect intermediate-level penicillin resistance (0,25 < 
MICs < 0.5|Lig/ml ). 

The comparison of hexA sequences allowed the design of an internal probe specific 
10 for S. pneumoniae (SEQ ID NO. 1180) (Annex XLII). The range of mismatches 
between the S. pneumoniae-specific 213-bp amplicon was from 2 to 5, in the middle 
of the 19-bp probe. Five internal probes containing all possible mutations were 
designed to detect the high-level penicillin resistance 3 83 -bp amplicon (SEQ ID NOs. 
1197, 1217-12-20) £nd five internal probes containing all possible mutations were 
15 designed to detect the 156-bp amplicon which includes both high-level and 
intermediate penicillin resistance (SEQ ID NOs. 1094, 1192, 1193, 1214 and 1216). 
Design and synthesis of primers and probes, and detection of the hybridization were 
performed as described in Example 7. Annex XXXI illustrates one of the internal 
probes for detection of the high-level penicillin resistance 383-bp amplicon (SEQ ID 
20 NO. 1197) and one of the internal probes for detection of the intermediate level 
penicillin resistance 156-bp amplicon (SEQ ID NO. 1193). 

PCR amplification. For all bacterial species, amplification was performed from 
purified genomic DNA. One fxl of 0.1 nanogram of genomic DNA was transferred 

25 directly to a 19 fil PCR mixture. Each PCR reaction contained 50 mM KC1; 10 mM 
Tris-HCl (pH 9.0); 0.1% Triton X-100; 2.5 mM MgCl 2 ; 0.1 fiM (each) of the S. 
pneumoniae species-specific primers SEQ ID NO. 1179 and SEQ ID NO. 1181, 0.1 
fiM of the penicillin resistance primer SEQ ID NO. 1129, 0.7 fiM of the other 
penicillin resistance primer SEQ ID NO. 1130, and 0.6 juM of the high-level 

30 penicillin resistance primer SEQ ID NO. 1131; 200 fiM (each) of the four 
deoxynucleoside triphosphates; 3.3 /ig//xl bovine serum albumin (BSA); and 0.5 U 
Taq polymerase coupled with 7a#Start™ Antibody. 

For determination of the sensitivities of the PCR assays, two-fold dilutions of 
35 purified genomic DNA were used to determine the minimal number of genome 
copies which can be detected. 

Results 

40 Amplifications with the multiplex PCR assay. The specificity of the assay was 
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assessed by performing 40-cycle PCR amplifications with the panel of gram-positive 
(67 species from 12 genera) and gram-negative (33 species from 17 genera) bacterial 
species listed in Table 13. All bacterial species tested other than S. pneumoniae were 
negative except S. mitis and S. oralis. Ubiquity tests were performed using a 
5 collection of 55 clinical S. pneumoniae isolates provided by the microbiology 
laboratory of the CHUL and the LSPQ, including high-level penicillin resistance 
(n=37), intermediate resistance (n=ll) and sensitive (n=7) strains. There was a 
perfect correlation between PCR and standard susceptibility testing for 37 isolates 
with high-level penicillin resistance and 7 penicillin-sensitive isolates. Among 11 
10 isolates with intermediate penicillin resistance, 9 had intermediate resistance based on 
PCR but two isolates with MIC of 0.5 |ng/ml showed a high-level penicillin resistance 
based on genotyping. This demonstrated that MIC of 0.5 lig/rnl may represent 
intermediate or high-level penicillin resistance. In general, there was a good 
correlation between the genotype and classical identification schemes. 

15 

The sensitivity of the S. pneumoniae-specific assay with 40-cycle PCR protocols 
was determined by using purified genomic DNA from 9 isolates of S. pneumoniae. 
The detection limit was 2-10 genome copies, depending on the pbp amplification 
pattern, 

20 

Hybridization between multiplex PCR amplicons and internal probes. The S. 

pneumoniae-specific internal probe did not hybridize to the S. mitis and S. oralis 
non-specific PCR amplification products generated from the S. pneunoniae-specific 
PCR primers. More precisely, this specificity was tested by using genomic DNA 
25 purified from 16 strains of S. mitis and from 3 strains of 5. oralis. The ubiquity of the 
assay was tested by using genomic DNA from 15 strains of S. pneumoniae. In 
summary, the combination of the PCR and hybridization assays results in a highly 
specific test for the detection of penicillin-resistant Streptococcus pneumoniae. 

30 EXAMPLE 21: 

Sequencing of the vancomycin resistance vanA. vanCL vanC2 and vanC3 genes . The 
publicly available sequences of the vanH-vanA-vanX-vanY locus of transposon 
Tnl546 from E. faecalis, vanCl sequence from one strain of E. gallinarum, vanC2 

35 and vanC3 sequences from a variety of E. casseliflavus and Enterococcus flavescens 
strains, respectively, allowed the design of PCR primers able to amplify the vanA, 
vanCl, vanC2 and vanC3 sequences of several Enterococcus species. Using primer 
pairs van6877 and van9106 (SEQ ID NOs. 1150 and 1155), vanCl-122 and vanCl- 
1315 (SEQ ID NOs. 1110 and 1109), and vanC2C3-l and vanC2C3-1064 (SEQ ID 

40 NOs. 1108 and 1107), it was possible to amplify and sequence vanA sequences 
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SEQ ID NOs. 1049-1057, vanCl sequences SEQ ID NOs. 1058-1059, vanC2 
sequences SEQ ID NOs. 1060-1063 and vanC3 sequences SEQ ID NOs. 1064-1066, 
respectively. Three other PCR primers (SEQ ID NOs. 1 1 5 1 -1 154) were also designed 
and used to complete the sequencing of vanA amplification products. 

5 

EXAMPLE 22: 

Development of a PCR assay for the detection and identification of enterococci at 
genus and species levels and its associated resistance genes vanA and vanB . The 

10 comparison of vanA and vanB sequences revealed conserved regions allowing the 
design of PCR primers specific for both vanA and vanB sequences (Annex 
XXXVIII). The PCR primer pair vanAB459 and vanAB830R (SEQ ID NOs. 1112 
and 1111) was used in multiplex with the Enterococcus genus-specific primers 
Encg3 1 3dF and EnGg599c (SEQ ID NOs, 1 1 37 and 1 1 36) described in Example 1 1 . 

15 Sequence alignment analysis of vanA and vanB sequences revealed regions suitable 
for the design of internal probes specific for vanA (SEQ ID NO. 1170) and vanB 
genes (SEQ ID NO. 1171). PCR amplification and agarose gel analysis of the 
amplified products were performed as described in Example 11. The specificity of the 
multiplex assay with 40-cycle PCR was verified by using 0.1 nanogram of purified 

20 genomic DNA from a panel of bacteria listed in Table 10. The sensitivity of the 
multiplex assay with 40-cycle PCR was verified with three strains of E. casseliflavus, 
eight strains of E, gallinarum, two strains of E. flavescens, two vancomycin-resistant 
strains of E. faecalis and one vancomycin-sensitive strain of E. faecalis, three 
vancomycin-resistant strains of E. faecium and one vancomycin-sensitive strain of E. 

25 faecium, and one strain of each of the other enterococcal species listed in Table 10. A 
detection limit of 1-10 genome copies was obtained, depending on the enterococcal 
species tested. In addition to the species-specific internal probes described in 
Example 11, the vanA- and vanfl-specific internal probes were able to recognize 
vancomycin-resistant enterococcal species with high sensitivity, specificity and 

30 ubiquity and with a perfect correlation between the genotypic and phenotypic 
analysis. 

The format of the assay is not limited to the one described above. A person skilled in 
the art could adapt the assay for different formats such as PCR with real-time 
detection using molecular beacon probes. Molecular beacon probes designed to be 
35 used in this assay include, but are not limited to, SEQ ID NO. 1 236 for the detection 
of E. faecalis, SEQ ID NO. 1235 for the detection of E. faecium, SEQ ID NO. 1240 
for the detection of vanA, and SEQ ID NO. 1241 for the detection of vanB. 



EXAMPLE 23: 

40 
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Development of a multiplex PCR assay for detection and identification of 
vancomycin-resistant Enterococcus faecalis, Enterococcus faecium, Enterococcus 
gallinarum* Enterococcus casseliflavus. and Enterococcus flavescens . The 
comparison of vanA and vanB sequences revealed conserved regions allowing design 
5 of a PCR primer pair (SEQ ID NOs. 1089 and 1090) specific for vanA sequences 
(Annex XXVIII) and a PCR primer pair (SEQ ID NOs. 1095 and 1096) specific for 
vanB sequences (Annex XXIX). The vanA-specific PCR primer pair (SEQ ID NOs. 
1089 and 1090) was used in multiplex with the vanB- specific PCR primer pair (SEQ 
ID NOs. 1095 and 1096). The comparison of vanCl, vanC2 and vanC3 sequences 

10 revealed conserved regions allowing design of PCR primers (SEQ ID NOs. 1101 and 
1102) able to generate a 158-bp amplicon specific for E. gallinarum, E. casseliflavus 
and E. flavescens (Annex XXX). The vonC-specific PCR primer pair (SEQ ID NOs. 
1101 and 1102) was used in multiplex with the E. faecalis species-specific PCR 
primer pair described in previous US patent serial no. 5,994,066 (SEQ ID NOs. 40 

15 and 41 in the said patent) and with the E. faecium species-specific PCR primer pair 
described in previous patent application WO98/20157 (SEQ ID NOs. 1 and 2 in the 
said application). For both multiplexes, the optimal cycling conditions for maximum 
sensitivity and specificity were found to be 3 min. at 94 °C, followed by forty cycles 
of two steps consisting of 1 second at 95 °C and 30 seconds at 58 °C, plus a terminal 

20 . extension at 72 °C for 2 minutes. Amplification was monitored on agarose gel 
electrophoresis by staining the amplified DNA with ethidium bromide. The vanA- 
specific PCR primer pair (SEQ ID NOs. 1089 and 1090), the vanfl-specific primers 
pair (SEQ ID NOs. 1095 and 1096) and the vanC-specific primer pair (SEQ ID NOs. 
1101 and 1102) were tested for their specificity by using 0.1 nanogram of purified 

25 genomic DNA from a panel of 5 vancomycin-sensitive Enterococcus species, 3 
vancomycin-resistant Enterococcus species, 13 other gram-positive bacteria and one 
gram-negative bacterium. Specificity tests using the E. faecium species-specific PCR 
primer pair described in previous patent application WO98/20157 (SEQ ID NOs. 1 
and 2 in the said application) and the E. faecalis species -specific PCR primer pair 

30 described in previous US patent serial no. 5,994,066 (SEQ ID NOs. 40 and 41 in the 
said patent) were performed on a panel of 37 gram-positive bacterial species. All 
Enterococcus strains were amplified with high specificity and there was a perfect 
correlation between the genotypic and phenotypic analysis. Finally, the sensitivity of 
the assays was determined for several strains of "E. gallinarum, E. casseliflavus, E. 

35 flavescens and vancomycin-resistant E. faecalis and E. faecium. Using each of the E. 
faecalis and E. faecium species-specific PCR primer pairs as well as vanA, vanB and 
wwC-specific PCR primers used alone or in multiplex as described above, the 
sensitivity ranged from 1 to 10 genome copies. 

The format of the assay is not limited to the one described above. A person skilled in 
40 the art could adapt the assay for different formats such as PCR with real-time 
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detection using molecular beacon probes. Molecular beacon probes designed to be 
used in this assay include, but are not limited to, SEQ ID NO. 1238 for the detection 
of E. faecalis, SEQ ID NO. 1237 for the detection of E. faecium, SEQ ID NO. 1239 
for the detection of vanA, and SEQ ID NO. 1241 for the detection of vanB. 

5 

EXAMPLE 24: 

Universal amplification involving the EF-G (fas A) subdivision of tuf sequences. As 
shown in Figure 3, primers SEQ ID NOs. 1228 and 1229 were designed to amplify 

10 the region between the end of fusA and the beginning of tuf genes in the str operon. 
Genomic DNAs from a panel of 35 strains were tested for PCR amplification with 
those primers. The following strains showed a positive result: Abiotrophia adiacens 
ATCC 49175, Abiotrophia defectiva ATCC 49176, Bacillus subtilis ATCC 27370, 
Closridium difficile ATCC 9689, Enterococcus avium ATCC 14025, Enterococcus 

15 casseiifiavus ATCC 25788, Enterococcus cecoram ATCC 43198, Enterococcus 
faecalis ATCC 29212, Enterococcus faecium ATCC 19434, Enterococcus flavescens 
ATCC 49996, Enterococcus gallinarum ATCC 49573, Enterococcus solitarius 
ATCC 49428, Escherchia coli ATCC 11775, Haemophilus influenzae ATCC 9006, 
Lactobacillus acidophilus ATCC 4356, Peptococcus niger ATCC 27731, Proteus 

20 mirabilis ATCC 25933, Staphylococcus aureus ATCC 43300, Staphylococcus 
auricularis ATCC 33753, Staphylococcus capitis ATCC 27840, Staphylococcus 
epidemidis_ ATCC 14990, Staphylococcus haemolyticus ATCC 29970, 
Staphylococcus homims ATCC 27844, Staphylococcus lugdunensis ATCC 43809, 
Staphylococcus saprophytics ATCC 15305, Staphylococcus simulans ATCC 27848, 

25 and Staphylococcus warned ATCC 27836. This primer pair could amplify additional 
bacterial species; however, there was no amplification for some species, suggesting 
that the PCR cycling conditions could be optimized or the primers modified. For 
example, SEQ ID NO. 1227 was designed to amplify a broader range of species. 
In addition to other possible primer combinations to amplify the region covering fusA 

30 and tuf, Figure 3 illustrates the positions of amplification primers SEQ ID NOs. 1221- 
1227 which could be used for universal amplification of Jits A segments. All of the 
above mentioned primers (SEQ ID NOs. 1221-1229) could be useful for the universal 
and/or the specific detection of bacteria. 

35 EXAMPLE 25: 

DNA fragment isolation from Staphylococcus saprophyticus bv arbitrarily primed 
PCR. DNA sequences of unknown coding potential for the species-specific detection 
and identification of Staphylococcus saprophyticus were obtained by the method of 
40 arbitrarily primed PCR (AP-PCR). 

AP-PCR is a method which can be used to generate specific DNA probes for 
microorganisms (Fani et al., 1993, Molecular Ecology 2:243-250). A description of 
the AP-PCR protocol used to isolate a species -specific genomic DNA fragment from 
Staphylococcus saprophyticus follows. Twenty different oligonucleotide primers of 

60 
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10 nucleotides in length (all included in the AP-PCR kit OPAD (Operon 
Technologies, Inc., Alameda, CA)) were tested systematically with DNAs from 5 
bacterial strains of Staphylococcus saprophyticus as well as with bacterial strains of 
27 other staphylococcal (non-5, saprophyticus) species. For all bacterial species, 
5 amplification was performed directly from one /zL (0.1 ng//xL) of purified genomic 
DNA. The 25 /zL PCR reaction mixture contained 50 mM KC1, 10 mM Tris-HCl (pH 
9.0), 0.1% Triton X-100, 2.5 mM MgCl. 2 , 1.2 fiM of only one of the 20 different AP- 
PCR primers OPAD, 200 fiM of each of the four dNTPs, 0.5 U of Tag DNA 
polymerase (Promega Corp., Madison, Wis.) combined with TaqStart™ antibody 

10 (Clontech Laboratories Inc., Palo Alto, CA). PCR reactions were subjected to cycling 
using a MJ Research PTC-200 thermal cycler as follows: 3 min at 96 °C followed by 
42 cycles of 1 min at 94 °C for the denaturation step, 1 mih at 3 1 °C for the annealing 
step and 2 min at 72 °C for the extension step. A final extension step of 7 min at 72 
°C followed the 42 cycles to ensure complete extension of PCR products. 

15 Subsequently, twenty microliters of the PCR-amplified mixture were resolved by 
electrophoresis on a 1 .5 % agarose gel containing 0.5 ptg/mL of ethidium bromide. 
The size of the amplification products was estimated by comparison with a 50-bp 
molecular weight ladder. 

Amplification patterns specific for Staphylococcus saprophyticus were observed with 
20 the AP-PCR primer OPAD- 16 (sequence: 5'-AACGGGCGTC-3'). Amplification with 
this primer consistently showed a band corresponding to a DNA fragment of 
approximately 380 bp for all Staphylococcus saprophyticus strains tested but not for 
any of the other staphylococcal species tested. 

The band corresponding to the 380 bp amplicon, specific and ubiquitous for 5. 

25 saprophyticus based on AP-PCR, was excised from the agarose gel and purified using 
the QIAquick™ gel extraction kit (QIAGEN Inc.). The gel-purified DNA fragment 
was cloned into the T/A cloning site of the pCR 2.1™ plasmid vector (Invitrogen 
Inc.) using T4 DNA ligase (New England BioLabs). Recombinant plasmid s were 
transformed into E. coli DH5a competent cells using standard procedures. All 

30 reactions were performed according to the manufacturer's instructions. Plasmid DNA 
isolation was done by the method of Birnboim and Doly (Nucleic Acid Res., 1979, 
7:1513-1523) for small-scale preparations. Ail plasmid DNA preparations were 
digested with the EcoRI restriction endonuclease to ensure the presence of the 
approximately 380 bp AP-PCR insert into the plasmid. Subsequently, a large-scale 

35 and highly purified plasmid DNA preparation was performed from two selected 
clones shown to carry the AP-PCR insert by using the QIAGEN plasmid purification 
kit (midi format). These large-scale plasmid preparations were used for automated 
DNA sequencing. 

The 380 bp nucleotide sequence was determined for three strains of 5. saprophyticus 
40 (SEQ ID NOs. 74, 1093, and 1198). Both strands of the AP-PCR insert from the two 
selected clones were sequenced by the dideoxynucleotide chain termination 
sequencing method with SP6 and T7 sequencing primers by using the Applied 
Biosystems automated DNA sequencer (model 373A) with their PRISM™ 
Sequenase R ™ Terminator Double-stranded DNA Sequencing Kit (Perkin-Elmer 
45 Corp., Applied Biosystems Division, Foster City, C A). 
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Optimal species-specific amplification primers (SEQ ID NOs. 1208 and 1209) have 
been selected from the sequenced AP-PCR Staphylococcus saprophytics DNA 
fragments with the help of the primer analysis software Oligo™ 4.0 (National 
Biosciences Inc.). The selected primers were tested in PCR assays to verify their 
5 specificity and ubiquity. Data obtained with DNA preparations from reference ATCC 
strains of 49 gram-positive and 31 gram-negative bacterial species, including 28 
different staphylococcal species, indicate that the selected primer pairs are specific 
for Staphylococcus saprophyticus since no amplification signal has been observed 
with DNAs from the other staphylococcal or bacterial species tested. 

10 

This invention has been described herein above, and it is readily apparent that 
modifications can be made thereto without departing from the spirit of this invention. 
These modifications are under the scope of this invention, as defined in the appended 

1 5 claims , 



20 



62 



CA 02307010 2000-05-19 



Tabl 1. Distribution (%) of n soc mial pathogens for various human Infections In USA 
(1990-1992) . 



Pathogen 


UTI 


a 

SSI 


4 

BSI 


Pneumonia 


CSF 5 


Escherichia coli 


27 


9 


5 


4 


2 


Staphylococcus aureus 


2 


21 


17 


21 


2 


Staphylococcus epidermidis 


2 


6 


20 


0 


1 


Enterococcus faecalis 


16 


12 


9 


2 


o 


Enterococcus faecium 


1 


1 


0 


0 


o 


Pseudomonas aeruginosa 


12 


9 


3 


18 


o 


Klebsiella pneumoniae 


7 


3 


4 


9 


0 


Proteus mirabilis 


5 


3 


1 


2 


n 


Streptococcus pneumoniae 


0 


0 


3 


1 


1ft 


Group B Streptococci 


1 


1 


2 




e 
o 


Other streptococci 


3 


5 


2 






Haemophilus influenzae 


0 


0 


0 


6 


*+o 


Neisseria meningitidis 


0 


0 


0 


o 




Listeria monocytogenes 


0 


0 


o 


n 


3 


Other enterococci 


1 


1 


0 


n _ 


0 


~Other staphylococci '™ 


2 


8 


13 


2 




Candida albicans 


9 


3 


5 


5 


0 


Other Candida 


2 


1 


3 


1 


0 


Enterobacter sp. 


5 


7 


4 


12 


2 


Acinetobacter sp. 


1 


1 


2 


4 


2 


Citrobacter sp. 


2 


1 


1 


1 


0 


Serratia marcescens 


1 


1 


1 


3 


1 
1 


Other Klebsiella 


1 


1 


1 


2 


Others 


0 


6 


4 


5 


0 



§™9i d ^^,5^^ su ' V9 " lance (NN ' S ' ,rom 60 hosp "* (Emorl and 

Urinary tract infection. 
Surgical site infection. 
Bloodstream infection. 
Cerebrospinal fluid. 
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Table 2. Distribution (%) of bloodstream infection path gens In Quebec (1995), Canada (1992) 
UK (1969-1988) and USA (1990-1992). 1 * 



Organism 


Quebec 1 


Canada 2 


UK 3 

Community- 
acquired 


Hospital- 
acquired 


USA 4 

Hospital- 
acquired 


E. coli 


15.6 


53.8 


OA ft 


on q 


5.0 


S. epidermidis and 












5 

other CoNS 


25.8 


- 


0.5 


7.2 


31.0 


S. aureus 


9.6 


- 


9.7 


19.4 


16.0 


S. pneumoniae 


6.3 


- 


22.5 


2.2 


- 


E. faecaUs 


3;G 


- 


1 .G 


4r2 


- 


£. faecium 


2.6 


- 


0.2 


0.5 


- 


Enterococcus sp. 


- 


- 


- 


- 


9.0 


H. influenzae 


1.5 


- 


3.4 


0.4 


_ 


P. aeruginosa 


1.5 


8.2 


1.0 


8.2 


3.0 


K. pneumoniae 


3.0 


11.2 


3.0 


9.2 


4.0 


P. mirabilis 




3.9 


2.8 


5.3 


1.0 


5. pyogenes 






1.9 


0.9 




Enterobacter sp. 


4.1 


5.5 


0.5 


2.3 


4.0 


Candida sp. 


8.5 






1.0 


8.0 


Others 


18.5 


17.4 


28.7 


18.9 


19.0 



EoXS^^ at the Centre Hosptta,ier de runiversrt6 Laval < CHUL > «™e a « 

Data from 10 hospitals throughout Canada representing 941 grarn-negatlve isolates. (Chamberland et al., 
1992, Clin. Infect. Dis., 15:615-628). 

Data from a 20-year study (1969-1988) for nearly 4000 isolates. (Eykyn et al., 1990, J. Antimicrob. 
Chemother., Suppl. C, 25:41-58). 

Data recorded by the National Nosocomial Infections Surveillance (NNIS) from 80 hospitals (Emori and 
Gaynes, 1993, Clin. Microbiol. Rev., 6:428-442). 
Coagulase-negative staphylococci. 
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Table 3. Distribution of positive and negative clinical specimens test d at the microbiol gy laboratory 
of the CHUL (February 1 994 - January 1 995). 



Clinical specimens 
and/or sites 


No. of samples 
tested (%) 


% of positive 
specimens 


% of negative 
specimens 


Urine 


17,981 (54.5) 


19.4 


80.6 


Blood culture/marrow 


10,010(30.4) 


6.9 


93.1 


Sputum 


1,266 (3.8) 


68.4 


31.6 


Superficial pus 


1,136 (3.5) 


72.3 


27.7 


Cerebrospinal fluid 


553 (1.7) 


1.0 


99.0 


Synovial fluid 


523 (1.6) 


2.7 


97.3 


Respiratory tract 


502(1.5) 


56.6 


43.4 


Deep pus 


473(1.4) 


56.8 


43.2 


Ears 


289 (0.9) 


47.1 


52.9 


Pleural and pericardial fluid 


132(0.4) 


1.0 


99.0 


Peritoneal fluid 


101(0.3) 


28.6 


71.4 


Total: 


32,966 (100.0) 


20.0 


80.0 



65 



CA 02307010 2000-05-19 



Tabl 4. Non-limitating xample of microbial species for which tut and/or atpD and/or recA sequences 
ar us d in th present invention. 

Bacterial species 



Achromobacter xylosoxidans subsp. denitrificans 
Acetobacterium woodi 
Aoetobacter aceti 

Acetobacter altoacetigenes 65 
1 0 Acetobacter polyoxogenes 
Acholeptasma laidlawii 
Acidiphilum facilis 
Acfnetobacter baumannii 

Acinetobacter calcoaceticus 70 
1 5 Acinetobacter Iwoffii 
Actinomyces meyeri 
Aerococcus viridans 
Aeromonas salmonicida 

Agrobacterium tumefaciens 75 
20 Atcaligenes faecalis 

Allochromatium vinosum 
Anabaena variabilis 
Anacystis nidulans 

Anaerorhabdus furcosus 80 
25 Aquifex aeolicus 
Aquifex pyrophiius 
Azotobacter vineiandii 
Bacillus anthracis 

Bacillus cereus 85 
30 Bacillus firmus 

Bacillus halodurans 
Bacillus megaterium 
Bacillus stearothermophilus 

Bacillus subtiiis 90 
35 Bacteroides distasonis 
Bacteroides fragilis 
Bacteroides ovatus 
Bacteroides vulgatus 

Bartonella henselae 95 
40 Bifidobacterium adolescentis 
Bifidobacterium breve 
Bifidobacterium dentium 
Bifidobacterium iongum 

Btastochloris viridis 100 
45 Borrelia burgdorferi 
Bordetella pertussis 
Bordetella bronchiseptica 
Branhameiia catarrhalis 

Brucella abortus 105 
50 Brevibacterium linens 
Brevibacterium flavum 
Buchnera aphidicola 
Burkholderia cepacia 

Burkholderia mallei 1 10 

55 Burkholderia pseudomatlei 
Campylobacter jejuni 
Cedecea davisae 
Cedecea lapagei 

Cedecea neteri 115 
60 Chlamydia pneumoniae 
Chlamydia psittaci 
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Chlamydia trachomatis 
Chlorobium vibrioforme 
Chforoflexus aurantiacus 
Chryseobacterium meningosepticum 
Citrobacter amalonaticus 
Citrobacter braakii 
Citrobacter farmeri 
Citrobacter freundii 
Citrobacter koseri 
Citrobacter sedlakii 
Citrobacter werkmanii 
Citrobacter youngae 
Clostridium acetobutylicum 
Clostridium beijerinckii 
Clostridium hUermentans 
Clostridium botulinum 
Clostridium difficile 
Clostridium innocuum 
Clostridium histolyticum 
Clostridium novyi 
Clostridium septicum 
Clostridium perfringens 
Clostridium ramosum 
Clostridium sordellii 
Clostridium tertium 
Clostridium tetani 
Comamonas acidovorans 
Corynebacterium bovis 
Corynebacterium cervicis 
Corynebacterium diphtheriae 
Corynebacterium flavescens 
Corynebacterium glutamicum 
Corynebacterium kutscheri 
Corynebacterium minutissimum 
Corynebacterium mycetoides 
Corynebacterium pseudodiphtheriticum 
Corynebacterium pseudogenitalium 
Corynebacterium pseudotuberculosis 
Corynebacterium renale 
Corynebacterium ulcerans 
Corynebacterium urealyticum 
Corynebacterium xerosis 
Coxiella burnetii 
Cytophaga lytica 
Deinococcus radiodurans 
Deinonema sp. 
Edwardsiella hoshinae 
Edwardsielta tarda 
Ehrlichia canis 
Ehrlichia risticii 
Eikenella corrodens 
Enterobacter aerogenes 
Enterobacter agglomerans 
Enterobacter amnigenus 
Enterobacter asburiae 
Enterobacter cancerogenus 
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Table 4. 



N n-limitating example of microbial species for which tut and/or atpD and/or recA sequences 
are used In the present Invention (continued). sequences 

Bacterial sp cles (continued) 



Enterobacter cloacae 

Enterobacter gergoviae 

Enterobacter hormaechei 

Enterobacter sakazakii 
1 0 Enterococcus a vium 

Enterococcus casse/rflavus 

Enterococcus cecorum 

Enterococcus columbae 

Enterococcus dispar 
15 Enterococcus durans 

Enterococcus faecalis 

Enterococcus faecium 

Enterococcus flavescens 

Enterococcus galJinarum 
20 Enterococcus hirae 

Enterococcus matodoratus 

Enterococcus mundtii 
Enterococcus pseudoavium 
Enterococcus raffinosus 
25 Enterococcus saccharolyticus 
Enterococcus solitarius 
Enterococcus suifureus 
Erwinia carotovora 
Escherichia coli 
30 Escherichia fergusonii 
Escherichia hermannii 
Escherichia vulneris 
Eubacterium tentum 
Eubacterium nodatum 
35 Ewingella americana 
Franciselta tularensis 
Frankia alni 

Fervidobacterium islandicum 

Fibrobacter succinogenes 
40 Flavobacterium ferrigeneum 

F/existipes sinusarabici 

Fusobacterium gonidiaformans 

Fusobacterium necrophorum subsp. necrophorum 

Fusobacterium nucleatum subsp. polymorphum 
45 Gardnerella vaginalis 

Gemella haemolysans 

Gemelia morbillorum 

Gloeobacter violaceus 

Gloeothece sp. 
50 Giuconobacter oxydans 

Haemophilus actinomycetemcomitans 
Haemophilus aphrophilus 
Haemophilus ducreyi 
Haemophilus haemolyticus 
55 Haemophilus influenzae 

Haemophilus parahaemolyticus 
Haemophilus parainfluenzae 
Haemophilus paraphrophilus 
Haemophilus segnis 
60 Hafnia alvei 

Haloarcula marismortui 
Halobacterium saJinarum 
Haloferax volcanii 
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Helicobacter pylori 
65 Herpetoshiphon aurantiacus 
Kingella kingae 
Klebsiella ornithinolytica 
Klebsiella oxytoca 
Klebsiella planticola 
70 Klebsiella pneumoniae subsp. ozaenae 

Klebsiella pneumoniae subsp. pneumoniae 
Klebsiella pneumoniae subsp. 
rhinoscleromatis 
Kluyvera ascorbata 
75 Kluyvera cryocrescens 
Kluyvera georgiana 
Lactobacillus acidophilus 
Lactobacillus garvieae 
Lactobacillus paracasei 
80 Lactobacillus casei subsp. casei 
Lactococcus lactis subsp. lactis 
Leclercia adecarboxylata 
Legionella micdadei 

Legionella pneumophila subsp. pneumophila 
85 Leminorella grimontii 
Leminorella richardii 
Leptospira bif/exa 
Leptospira interrogans 
Listeria monocytogenes 
90 Macrococcus caseoiyticus 

Magnetospirillum magnetotacticum 
Megamonas hypermegale 
Methanobacterium thermoautotrophicum 
Methanococcus jannaschii 
95 Methanococcus vannielii 
Methanosarcina barken 
Methanosarcina jannaschii 
Methylobacil/us flagellatum 
Methylomonas clara 
1 00 Micrococcus luteus 
Micrococcus lylae 
Mitsuokella multacida 
Mobiluncus curtisii subsp. holmesii 
Moellerella thermoacetica 
105 Moellerella wisconsensis 
Moraxella osloensis 
Morganella morganii subsp. morganii 
Mycobacterium avium 
Mycobacterium bovis 
1 10 Mycobacterium gordonae 
Mycobacterium leprae 
Mycobacterium tuberculosis 
Mycoplasma capricolum 
Mycoplasma gallisepticum 
115 Mycoplasma genitalium 
Mycoplasma hominfs 
Mycoplasma pirum 
Mycoplasma mycetoides 
Mycoplasma pneumoniae 
1 20 Mycoplasma pulmonis 
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Table 4. Non-limftating example of microbial species for which fuf and/or atpD and/or recA 
sequences are used in the present Invention (continued). 

Bact rial species (c ntlnued) 



Mycoplasma salivarium 

Myxococcus xanthus 

Neisseria animalis 

Neisseria canis 
10 Neisseria cinerea 

Neisseria cunicuii 

Neisseria eiongata subsp. elongata 

Neisseria eiongata subsp. intermedia 

Neisseria flava 
15 Neisseria fiavescens 

Neisseria gonorrhoeae 

Neisseria iactamica 

Neisseria meningitidis 

Neisseria mucosa 
20 Neisseria perflava 

Neisseria pharyngis 

Neisseriapotysaecharaa 

Neisseria sicca 

Neisseria subfiava 
25 Neisseria weaveri 

Ochrobactrum anthropi 

Pantoea aggfomerans 

Pantoea dispersa 

Paracoccus denitrifrcans 
30 Pasteureila muitocida 

Pectinatus frisingensis 

Peptococcus niger 

Peptostreptococcus anaerobius 

Peptostreptococcus asaccharoiyticus 
35 Peptostreptococcus prevotii 

Phormidium ectocarpi 

Pireilula marina 

Pianobispora rosea 

Plectonema boryanum 
40 Porphyromonas asaccharoiytica 

Porphyromonas gingfvaiis 

Pragla fontium 

Prevoteiia meianinogenica 

Prevoteiia oraiis 
45 Prevoteiia ruminocoia 

Prochlorothrix hoilandica 

Propionibacterium acnes 

Propionigenium modestum 

Proteus mirabilis 
50 Proteus penneri 

Proteus vulgaris 

Providencia afcalifaciens 

Providencia rettgeri 

Providencia rustigianii 
55 Providencia stuartii 

Pseudomonas aeruginosa 

Pseudomonas fiuorescens 

Pseudomonas stutzeri 

Psychrobacter phenylpyruvicus 
60 Rahnella aquatiiis 

Rickettsia prowazekii 

Rhodobacter capsulatus 

Rhodobacter sphaerolides 



Rhodospirillum rubrum 
65 Ruminococcus albus 
Salmonella bongori 

Salmonella choleraesuis subsp. arizonae 

Salmonella choleraesuis subsp choleraesuis 

Salmonella choleraesuis subsp. diarizonae 
70 Salmonella choleraesuis subsp. houtenae 

Salmonella choleraesuis subsp. indica 

Salmonella choleraesuis subsp. salamae 

Serpulina hyodysenteriae 

Serratia ficaria 
75 Serratia fonticola 

Serratia grimesii 

Serratia tiquefaciens 

Serratia marcescens 

Serratia odorifera 
88 Serratia plymuthica 

Serratia rubidaea 

Shewanella putida 

Shewanella putrefaciens 

Shigella boydii 
85 Shigella dysentehae 

Shigella flexneh 

Shigella sonnel 

Spirochaeta aurantia 

Staphylococcus aureus 
90 Staphylococcus aureus subsp. aureus 

Staphylococcus auriculahs 

Staphylococcus capitis subsp. capitis 

Staphylococcus cohnii 

Staphylococcus epidermidis 
95 Staphylococcus haemotyticus 

Staphylococcus hominis 

Staphylococcus lugdunensis 

Staphylococcus saprophytics 

Staphylococcus sciuri subsp. sciuri 
1 00 Staphylococcus simulans 

Staphylococcus wameri 

Stigmateila aurantiaca 

Stenotrophomonas maltophilia 

Streptococcus acidominimus 
1 05 Streptococcus agalactiae 

Streptococcus anginosus 

Streptococcus bovis 

Streptococcus cricetus 

Streptococcus cristatus 
1 10 Streptococcus downei 

Streptococcus dysgalactiae 

Streptococcus equi subsp. equi 

Streptococcus ferus 

Streptococcus gordonii 
115 Streptococcus macaca e 

Streptococcus mitis 

Streptococcus mutans 

Streptococcus oralis 

Streptococcus parasanguinis 

120 
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Table 4. Non-llmitating example of mlcr bial sp cies for which tuf and/ r atpD and/or recA 
sequ nc s are used in the present invent! n (continued). 

Bacterial species (continued) 



Streptococcus pneumoniae 
Streptococcus pyogenes 
Streptococcus ratti 
Streptococcus salivarius 
1 0 Streptococcus salivarius subsp. thermophilus 
Streptococcus sanguinis 
Streptococcus sobrinus 
Streptococcus suis 
Streptococcus uberis 
15 Streptococcus vestibularis 
Streptomyces anbofaciens 
Streptomyces aureofaciens 
Streptomyces cinnamoneus 
Streptomyces coelicofor 
20 Streptomyces collinus 
Streptomyces IMdans 
Streptomyces ramocissimus 
Streptomyces rimosus 

Streptomyces venezuelae 
25 Synechococcus sp. 

Synechocystis sp. 

Tatumetla ptyseos 

Taxeobacter occeaius 

Thermoplasma acidophilum 
30 Thermotoga maritima 

Thermus aquaticus 

fhermus thermophilus 

Thiobacillus ferrooxydans 

Thiomonas cuprina 
35 Trabulsiella guamensis 

Treponema pallidum 

Ureaplasma urealyticum 

Veillonella parvula 

Vibrio afginolyticus 
40 Vibrio anguiliarum 

Vibrio cholerae 

Wolinella succinogenes 

Xanthomonas citri 

Xanthomonas oryzae 
45 Xenorhabdus bovieni 

Xenorhabdus nematophilus 

Yersinia bercovieri 

Yersinia enterocolitica 

Yersinia frederikensii 
50 Yersinia intermedia 

Yersinia pestis 

Yersinia pseudotuberculosis 

Yersinia rohdei 

Yokenella regensburgei 
55 Zoogloea ramigera 

Fungal species 

Absidia corymbifera 
60 Absidia glauca 

Altemaria aftemata 
Arxula adeninivorans 



Aspergillus oryzae 
Aspergillus flavus 
65 Aspergillus fumigatus 
Aspergillus niger 
Aureobasidium pullulans 
Bipolaris hawaiiensis 
Blastoschizomyces capitatus 
70 Candida albicans 
Candida catenulata 
Candida dubliniensis 
Candida famata 
Candida glabrata 
75 Candida guilliermondii 
Candida haemulonii 
Candida inconspicua 
Candida kefyr 
Candida krusei 
80 Candida lambica 
Candida lusitaniae 
Candida norvegensis 
Candida parapsitosis 
Candida rugosa 
85 Candida sphaerica 
Candida tropicalis 
Candida utilis 
Candida viswanathii 
Candida zeylanoides 
90 Ciadophialophora carrionii 
Coccidioides immitis 
Coprinus cinereus 
Cryptococcus albidus 
Cryptococcus humicolus 
95 Cryptococcus neoformans 

Cunninghamella bertholletiae 
Curvularia lunata 
Emericella nidulans 
Exophiala jeansetmei 
100 Eremothecium gossypii 
Fonsecaea pedrosoi 
Fusarium oxysporum 
Geotrichum sp. 
Histoplasma capsulatum 
1 05 Issatchenkia orientalis 
Kluyveromyces lactis 
Malassezia furfur 
Malassezia pachydermatis 
Malbranchea filamentosa 
1 1 0 Metschnikowia pulcherrima 
Microsporum audouinii 
Mucor circinelloides 
Neurospora crassa . 
Paecilomyces lilacinus 
i 1 5 Paracoccidioides brasiliensis 
Penicillium marneffei 
Phlafaphora verrucosa 
Pichia anomaia 
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Table 4. Non-limitating xample of microbial species for which tuf and/or atpD and/or recA 
sequenc s are used in the pr sent inv ntion (c ntinued). 



Fungal species (continued) 



Piedraia hortai 
Podospora anserina 
Puccinia graminis 
Pseudallescheria boydii 

1 0 Rhlzomucor racemosus 
Rhizopus oryzae 
Rhodotorula minuta 
Rhodotorula mucilaginosa 
Saccharomyces cerevisiae 

15 Saksenaea vasiformis 

Schizosaccharomyces pombe 
Scoputariopsis koningii 
SporobolomycBS salmonicolor 
Sporothrix schenckii 

20 Stephanoascus ciferrii 

Syncephalastrum racemosum 
Trichodefma reesel 
Trichophyton mentagrophytes 
Trichophyton tonsurans 

25 Thchosporon cutaneum 
Ustiiago maydis 
Wangiella dermatitidis 
Yarrowia iipofytica 
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Trypanosoma brucei subsp. brucei 
Trypanosoma congolense 
Trypanosoma cruzi 



30 



Parasitical species 



Babesia bigemina 

Babesia bovis 

Babesia microti 
35 Blastocysts hominis 

Crithidia fascicuiata 

Cryptosporidium parvum 

Entamoeba histolytica 

Giardia lambiia 
40 Kentrophoros sp. 

Leishmania aethiopica 

Leishmania amazonensis 

Leishmania braziiiensis 

Leishmania donovani 
45 Leishmania infantum 

Leishmania enriettii 

Leishmania gerbiiii 

Leishmania guyanensis 

Leishmania hertigi 
50 Leishmania major 

Leishmania mexicana 

Leishmania tarentofae 

Leishmania tropica 

Neospora caninum 
55 Onchocerca volvuius 

Plasmodium berghei 

Plasmodium falciparum 

Plasmodium knowlesi 

Porphyra purpurea 
60 Toxoplasma gondii 

Treponema pallidum 

Trichomonas vaginalis 
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Tables. Antibiotic resistanc genes 

Genes Antibiotics 



selected for diagnostic purposes. 

1 ACCESSION NO. 



aac(3)-lb 2 

aac(3)-llb 2 

10 aac(3)-IVa 2 
aac(3)-Vla 2 

aac(2)-1a 2 

15 aacffl-aphP")' 



20 



25 



30 



aac(6>/a, d 
aac(6-)-lc 2 
aac(6')-Ua 7 



40 



45 



Aminoglycosides 

Aminoglycosides 

Aminoglycosides 
Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

VadB [ant(2"Ha 2 ] Aminoglycosides 

aacC1 [aac(3Ha 2 \ Aminoglycosides 

aacC2 [aac(3Hla 2 ) Aminoglycosides 

aacC3 [aac(3)-W 2 ] Aminoglycosides 

aacAA [aacM'lb 2 ] Aminoglycosides 

ant(3 a )-la 2 Aminoglycosides 

" T T^inoglycbsides 
Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Aminoglycosides 

Streptomycin 

Streptomycin 



ant(4>la 2 
aph(3')-la 2 

aph(3')-Ha 2 



35 aph(31-ttia 2 
aph(3*)-Vla 2 
2 



rrs 



rpsL* 



blaoXA 



5,6 



biaROB° 
50 blasHV 5 ' 6 
blajEM 53 

55 blacARB 5 



B-lactams 
B-lactams 
B-lactams 
B-lactams 

B-lactams 



Bacteria' 



Enterobacteriaceae 
Pseudomonads 
Enterobacteriaceae, 
Pseudomonads 
En terobacteriaceae 
En terobacteriaceae, 
Pseudomonads 
Enterobacteriaceae, 
Pseudomonads 

Enterococcus sp., 

Staphylococcus sp. 

Enterobacteriaceae, 

Pseudomonads 

Enterobacteriaceae, 

Pseudomonads 

Pseudomonads 

Enterobacteriaceae 

Pseudomonads 

Pseudomonads 

Pseudomonads 

Pseudomonads 

Enterobacteriaceae, 

Enterococcus sp., 

Staphylo coccus sp. 



Staphylococcus sp . 

Enterobacteriaceae, 

Pseudomonads 

Enterobacteriaceae, 

Pseudomonads 

Enterococcus sp.. 

Staphylococcus sp. 

Enterobacteriaceae, 

Pseudomonads 

M tuberculosis 

M. tuberculosis, 
M. avium complex 



Enterobacteriaceae, 
Pseudomonads 
Haemophilus sp. 
Pasteurella sp. 
Enterobacteriacea, 
Pseudomonas aeruginosa 
Enterobacteriaceae, 

Neisseria sp., 
Haemophilus sp. 

Pseudomodas sp., 

Enterobacteriaceae 



L06157 

M97172 

X01385 
M88012 

X04555 



M18967 
M94066 



X02340 
M10241 

V01282 — 
J01839 

V006\8 

V01547 

X07753 

L15307 
S62531 
X80120 
U 14749 
X70995 
L0801 1 



SEQ ID NO. 
(genes) 



83-86 3 



112 4 
53-54 3 
55-56 3 
57-58 3 
59-60 3 
65-66 3 



110 4 
45-48 3 
41 -44 3 
37-40 3 



J05162 
S46063 
M69058 



60 
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Antibiotic resistance genes 



selected for diagnostic purposes (continued). 




10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



MacTX-M-1 * 
M*CTX-M-2 l 
M*CMY-2 7 
blapER-1 5 

MapER-2 7 
IMP* 



blaZ 



12 



mecA 
penA 
pbpla 



12 
13 



13 



B-lactams 
B-lactams 
B-lactams 
B-lactams 

B-lactams 
B-lactams 

B-lactams 

B-lactams 
B-lactams 
B-lactams 



Enterobacteriaceae 

Enterobacteriaceae 

Enterobacteriaceae 

Enterobacteriaceae, 

Pseudomodanaceae 

Enterobacteriaceae 

Enterobacteriaceae, 
Pseudomonas aeruginosa 

Enterococcus sp., 
Staphylococcus sp. 
Staphylococcus sp. 
Neisseria gonorrhoeae 
Streptococcus pneumoniae 



pbp2b 



13 



B-lactams 



Streptococcus pneumoniae 



X92506 
X92507 
X91840 
Z21957 

X93314 
AJ223604 



X54021 

f/.SG527 
X67872 
AB006868 
AB006874 

X67873 
AB006878 
AB006875 
AB006877 
AB006879 
AF046237 
AF046235 
AF026431 
AF046232 
AF046233 
AF046236 
X67871 
Z49095 
AF046234 
AB006873 
X67866 
X67868 
AB006870 
AB006869 
AB006872 

X67870 
AB006871 
X67867 
X67869 
AB006876 
AF046230 
AF046238 
Z49094 

X16022 
M25516 
M25518 
M25515 
U20071 
U20084 
U20082 
U20067 
U20079 
Z22185 
U20072 



111 4 
97-98 3 
SEE TABLE 7 



SEE TABLE 7 
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Table 5. 



Antibiotic resistance genes sel cted for diagnostic purpos s (continued). 



Genes 



Antibiotics 



Bacteria 1 



ACCESSION NO. 



pbp2b 



13 



B-lactams 



Streptococcus pneumoniae 



10 



15 



20 



25 



30 pbp2x 



35 



13 



B-lactams 



Streptococcus pneumoniae 



40 



45 



50 



55 



int 
sul 



ermA 14 
60 erroB 14 



(3 -lactams, 
trimethoprim 

aminoglycosides, 

antiseptic, 

chloramphenicol 

Macrolides, 
lincosamides, 
streptogramin B 

Macrolides, 

lincosamides. 
streptogramin B 



Enterobacteriaceae, 
Pseudomonads 

Staphylococcus sp. 



Enterobacteriaceae, 
Staphylococcus sp. 
Enterococcus sp. 
Streptococcus sp. 



U20083 

U20081 

M25522 

U20075 

U20070 

U20077 

U20068 

Z22184 

U20069 

U20078 

M25521 

M25525 

M25519 

Z21981 

M25523 

M25526 

M25524 

222230 

U20073 

U20080 

U20074 

U20076 

M25520 

M25517 

X 16367 
X65135 
AB011204 
AB011209 
AB011199 
AB011200 
AB011201 
AB011202 
AB011198 
AB011208 
AB011205 
AB015852 
AB011210 
AB015849 
AB0 15850 
AB015851 
AB015847 
ABO 15846 
ABO 11207 
AB015848 
Z49096 



SEQ ID NO. 
(genes) 



SEE TABLE 7 



99-102 3 
103-106 3 

113 4 

114 4 
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Table 5. 



Genes 



Antibi tier sistance genes selected! r diagnostic purp ses (continued) 

— ^ ACCESSION NO. 



ermC 



14 



10 ere A 



12 



ereB 



12 



15 



20 



25 



30 



35 



40 



45 



50 



55 



msrA 



mtr* 



12 



mefA, mefE Q 

mphA B 

linAAlnA'* 

i/nB 10 
rma 11 



15 
15 



vga 
vgb 
vat™ 
vafB 15 

sa£A 15 



mupA 



12 



9yrA 



16 



parC/grtA™ 



Antibiotics 



Bacteria 1 



Macroltd s, 
lincosamides, 
streptogramin B 

Macrolides 

Macrolides 

Macrolides 

Macrolides 



Macrolides 

Macrolides 

Lincosamides 

Lincosamides 
Macrolides 

Streptrogramin 
Streptrogramin 
Streptrogramin 
Streptrogramin 

Streptrogramin 
Mupirocin 
Mupirocin 
Quinolones 



Quinolones 



Enterobacteriaceae, 
Staphylococcus sp. 



Enterobacteriaceae, 
Staphylococcus sp. 
Enterobacteriaceae 
Staphylococcus sp. 
Staphylococcus sp. 

Neisseria gonorrhoeae 



Streptococcus sp. 

Enterobacteriaceae, 
Staphylococcus sp. 
Staphylococcus sp. 

Enterococcus faecium 
Mycobacterium 
avium complex 
Staphylococcus sp. 
Staphylococcus sp. 
Staphylococcus sp. 
Staphylococcus sp. 

Enterococcus faecium 
Staphylococcus aureus 
Staphylococcus aureus 

Gram-positive and 
gram-negative bacteria 



Gram-positive and 
gram-negative bacteria 



M11277 
A15097 



S42418 
S40252 
S42417 
S40251 
Z25796 
U 14993 
G5-10G7 
Q51006 
Q51073 
AF037040 
AF037041 

U70055 
U83667 

D 16251 

J03947 
M 14039 
AF1 10130 
U74494 



M36022 

U19456 
L38809 

X74219 
X75439 
X9571B 
X06744 
X57174 
X16817 
X71437 
AF065152 
AF060881 
D32252 
AB005036 
AF056287 

X95717 
AF129764 
AB017811 
AF065152 



SEQ ID NO. 
(g nes) 



115' 



77-80 * 



89-90 3 
87-88 3 

81-82 3 
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Table 5. 

Gen s 



Antibiotic resistance g nes 



s lect d for dlagn stic purp ses (continued). 



10 



15 



20 



25 



30 



parE/grlB™ 
no/* 16 

mexR(nalB)^ 6 
nfxB™ 



cat 



12 



rpoB 



inhA 



17 



17 



35 katG 



17 



40 



45 



50 



ahpC 



17 



embB 
pncA 
vanA 



17 
17 
12 



vanB 



12 



vanC1 12 
55 vanC2 12 

60 



Antibiotics 

Quinolones 

Quinolones 

Quinolones 
Quinolones 
Chloramphenicol 



1 



Bacteria 
Gram-positive bacteria 
Staphylococcus sp. 



Pseudomonas aeruginosa 
Pseudomonas aeruginosa 
Gram-positive and 
gram-negative bacteria 



Rifampin 



Mycobacterium tuberculosis 



Isoniazid Mycobacterium tuberculosis 

Isoniazid Mycobacterium tuberculosis 



Isoniazid 



Ethambutol 

Pyrazinamide 

Vancomycin 

Vancomycin 
Vancomycin 

Vancomycin 



Mycobacterium tuberculosis 



Mycobacterium tuberculosis 
Mycobacterium tuberculosis 
Enterococcus sp. 

Enterococcus sp. 
Enterococcus gaflinarum 

Enterococcus casseliflavus 



ACCESSION NO. 



X95717 
AF065153 
AF058920 

D90119 
M80252 
M97169 

U23763 

X66646 

M55620 

X15100 

A24651 

M28717 

A00568 

A00569 

X74948 

Y00723 

A24362 

A00569 

M93113 

M62822 

M58516 

AF055891 
AF055892 
S71246 
L27989 
AF055893 

AF 106077 
U02492 

U40593 
U06259 
U06260 
U06261 
U 06262 
U40594 
U40595 

U43812 
U57761 
U24085 
U 16243 
U58030 
U18264 

U68480 

U59967 



U94521 
U 94522 
U94523 
U94524 
U94525 
L29638 



SEQ ID NO. 
(genes) 



67-70 3 

SEE TABLE 7 

116 4 

117 4 

SEE TABLE 7 
SEE TABLE 7 
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Table 5. Antibiotic resistance gen 



s selected for diagnostic purposes (continued). 



Genes 



vanC3^ 2 



vanD 



18 



19 
19 



tetB 
tetM 

sum 20 



Antibiotics 



Vancomycin 



Vancomycin 
Tetracycline 
Tetracycline 

Sulfonamides 



dhfria 20 


Trimethoprim 


dhfrlb 20 


Trimethoprim 


dhfrV 2 * 


Trimethoprim 


dhfrVII 20 


Trimethoprim 


dhfrVHI 20 


Trimethoprim 


dhfrlX 20 


Trimethoprim 


dhfrXH 20 


Trimethoprim 


dfrA 20 


Trimethoprim 



Bacteria 1 



Enterococcus flavescens 



Enterococcus faecium 
Gram-negative bacteria 

Gram-negative and 
Gram-positive bacteria 
Gram-negative bacteria 



Gram-negative bacteria 

Gram-negative bacteria 

Gram-negative bacteria 
Gram-negative bacteria 
Gram-negative bacteria 
Gram-negative bacteria 
Gram-negative bacteria 
Staphylococcus sp. 



ACCESSION NO. 



L29639 
U72706 
L29640 
AF1 30997 
J01830 
X52632 

D37827 
M36657 
AF017389 
AF017391 
AJ238350 
X17477 
Z50805 
250804 

X12868 
U31119 
U10186 
X57730 
Z21672 

AF045472 
U40259 

AF051916 



SEQ ID NO. 
(genes) 

SEE TABLE 7 



Bacteria having high incidence for the specified antibiotic resistance^ The presence ol the antibiotic 
resistance genes in -J^^^^^ Miller . 1993 . Molecu.ar genetics of aminoglycoside 

enZym6S - MiCr0bi ° L 

^Irtm genes from our co-pending US patent no. 6.001 ,564 for which we have selected 
PCR primer pairs. 

These SEQ ID NOs. refer to a previous patent (application WO98/201 57). 

lusn K G A. Jacoby and A. Medeiros. t995. A functional classification scheme for lactamase and rts 

Mrreiati or i with molecular structure. Antimicrob. Agents. Chemother. 39:1 21 1 -1 233. 

Nuclide lations in b.a S HV. **TEM, and b.a 0 XA. are associated with extended-spectrum B- 

tSSSX T, WSSZX KSTlSt Plasmid-encoded AmpC beta-lactamases: how far have we 
nnnft 1 o ears after discovery? Yonsei Med. J. 39:520-525. 

. rt 1891, « u c it-,. rt n a Yumk K A Farlev. B. J. Stockman, and R. Leciercq. 

W ^TrZ-~r^e;^Zl^t^Z & nutation ,„ ««*». 

faec/umHM1025. Antimicrob. Agents. Chemother. 43:925-929. 
11 CocZl HI. F.R. 1999. Genetic methods for assessing antimicrobial resistance. Ant.rn.crob. Agents. 
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binding-protein-mediated resistance to B-lactam antibiotics. Trends Molec. Microbiol.2: 361 -366. 

14 Jensen. L. B., N. Frimodt-Moller. F. M. Aarestrup^SSS. Presence of erm gene classes .n Gram-pos.t,ve 
bacteria of animal and human origin in Denmark. FEMS M.crob.ol. 170:151-158. 

15 Thai. L. A., and M. J. Zervos. 1999. Occurrence and epidemiology of resistance to v.rgimmycin and 
5 streDtrogramins. J. Antimicrob. Chemother. 43: 171 -176- 

16 Martinez J. L.. A. Alonso. J. M. Gomez-Gomez, and F. Baquero 1998. Q™°}°™ ^ance by mutafons 
in chromosomal gyrase genes. Just the tip of the iceberg? J. Ant.rn.crob. Chemother. 42.683-688 

17cockerill III. F.R. 1999. Genetic methods for assessing antimicrobial resistance. Anumicrob. Agents. 

Chemother.' 43:1 99-212. 

10 18 casadewall B. and P. Courvalin. 1999 Characterization of the vanD glycopept.de res.stance gene cluster 
from Enterococcus faecium BM 4339. J. Bacteriol. 181:3644-3648. 
19 Roberts. M.C. 1999. Genetic mobility and distribution of tetracycline resistance determ.nants.Ciba Found. 

Symp. 207:206-222. . . 

20 Hu ovinen. P., L. SundstrSm. G. Swedberg. and O. Skold. 1995. Trimethopnm and sulfonam.de 
1 5 resistance.Antimicrob. Agent Chemother. 39:279-289. 
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Table 6. List of bacterial toxins selected for diagn stic purposes. 



Organism 



Toxin 



Accession number 



Actinobacillusactinomycetemcomitans 

Actinomyces pyogenes 
Aeromonas hydrophila 

Bacillus anthrads 
Bacillus cereus 



Bacillus mycoides 
Bacillus pseudomycoides 
Bacterotdes fragilis 



Bordetella bronchiseptica 



Bordetella pertussis 



Campylobacter jejuni 
Citrobacter freundii 
Clostridium botulinum 



Cytolethal distending toxin (cotA, cdS, cdtC) 
Leukotoxin (rtxA) 

Hemolysin (pyolysin) 

Aerolysin (aerA) 
Haemolysin (hlyA) 
Cytotonic enterotoxin (a/r) 

Anthrax toxin (cya) 

Enterotoxin (bceJ) 

Enterotoxic hemolysin BL 

Non-haemolytta enterotoxins A,B and C {nne) 

Hemoiytic enterotoxin HBL 

Hemolytic enterotoxin HBL 

Enterotoxin (6ftP) , 
Matrix m^rioWotQas^n^rotoxin tfr&giiywrij 
Metatloprotease toxin-2 

Metalloprotease toxin-3 

Adenylate cyclase hemolysin (cyaA) 
Dermonecrotic toxin (dnr) 

Pertussis toxin (S1 subunit, /ox) 

Patente; 

EP0322533-A 2 05iulB9 
EP0322l15-A5 28|un89 
EP0396964-A 1 14nov90 
JP1987228286-A 1 7oct87 



Adenyl cyclase (cya) 
Dermonecrotic toxin (dnf) 

Cytolethal distending toxin (cotA, cd*B, cdtC) 

Shlga-like toxin (s/f-IIcA) 

Botulism toxin (BoNT) 

The A.B.E and F serotypes are 
neurotoxic for humans 

The other serotypes have not been considered 

Partial sequences (<200 bp) have 
not been considered 



Clostridium difficile 



A toxin (enterotoxin) {tcdA) 



AF006830 
M27399 

U84782 

M16495 
U81555 
L77573 

M23179 

D17312 

AF1 92766, AF1 92767 

AJ237785 

Y19005 

AJ 243 150 to AJ2431 53 
AJ243154to AJ243156 
U67735 

S7534-; . AFC3S453 
U90931 
AF081785 
AF056297 

Z37112, U 22953 

U59687 

AB020025 

AJ 006151 

AJ0061S3 

AJ006155 

AJ008157 

AJ006159 

AJ007363 

M14378, M16494 

AJ 007364 

M13223 

X16347 

18323 

U10527 

U51121 

X67514, S53206 

X52066, X52088 

X73423 

M30196 

X70814 

X70819 

X71343 

Z11934 

X70817 

MB1186 

X70818 

X70815 

X62089 

X62683 

S76749 

X81714 

X70816 

X70820 

X70281 

L35496 

M 92 906 

ABO 12304 

AF053400 

Y12616 

X51797 

X17194 

M30307 
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Table 6. List f bacterial toxins selected for diagnostic purposes (continued). 



Organism 



Toxin 



Accession number 



Clostridium difficile 
Clostridium perfringens 



Clostridium sordellii 
Clostridium tetani 

Corynebacterium diphtheriae 

Corynebacterium pseudotuberculosis 

Eikenella corrodens 
Enterobacter cloacae 
Enterococcus faecaiis 
Escherichia colt (EHEC) 



B toxin (cytotoxin) {toxB) 
Alpha (phospholipase C) (cpa) 



Beta (deimonecrottc protein) (cpb) 
Enterotoxln (cpe) 

Enterotoxin pseudogene (not expressed) 

Epsilon toxin (etxD) 

lota (la and lb) 

Lambda (metalioprotease) 

Theta (perfringolysin O) 

Cytotoxin L 

Tetanos toxin 

Diphtheriae toxin 
Patenl : 

JP 1985227681 -A/1 

Phospholipase C 
Patent : WO 901 1 351 -A 2 

lysine decarboxylase (cadA) 

Shiga-like toxin II 

Cytolysin B (cyfB) 

Hemoiysln toxin (n/yA and ehxA) 



Shiga-like (Vera cytotoxin) {six) 

Contains the sequences for both the 
A and B subunits 
Patent JP 1995008280-A/1 



Z23277 
X53138 

L43545 

L43546 

L43547 

L43548 

X13608 

X17300 

D10248 

L13198 

X83275 

L77965 

AJ00O766 

M98037 

X81849 

X71844 

Y16009 

AF037328 

AF037329 

AF037330 

M80837 

M95206 

X60694 

X73562 

D45904 

M36704 

X62638 

X06214 

X04436 

X00703 



A21336 



U89166 

Z50754, U33502 

M38052 

AF043471 

X94129 

X79839 

X86087 

AB011549 

AF074613 

X81418 

M14107 

M10133 

M12863 

X81417 

X81416 

X81415 

Z36900 

L11078 

L04539 

L11079 

X65949 

M21534 

M29153 

Z37725 

Z36901 

X61283 

AB017524 

U72191 
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Table 6. 



List of bacterial t xins s lected lor diagnostic purposes (continued). 



Organism 



Toxin 



Escherichia colt (ETEC) 



Escherichia coii {other) 



Haemophilus ducreyi 
Helicobacter pylori 



Legionella pneumophila 
Listeria monocytogenes 



Pasteurella multocida 

Proteus mirabilis 
Pseudomonas aemginosa 
Salmonella typhimurium 

Sermtia marcescens 
Shigella dysenteriae type 1 

Shigella flexneri 



Shigella sonnel 
Sphingomonas paucimobilis 
Staphylococcus aureus 



Enterotoxin (heat-labile) {elB) 
Patents : EP0145486-A5 
WO 9313202-A42 
JP1986005097-A 
JP 1992320675- A 

Enterotoxin (heat-stable) (astA) (estA1) 



Cytolethal-distending toxin 
(cdt) (3 genes) 

Cytotoxic necrotizing factor 1 {cnf\ ) 
Microcln 24 {mtfS) 

Autotransporter enterotoxin {Pet) (cytotoxin) 
Cytoiethal distending toxin (cdiA, ccftB, ccrtC) 
Vacuolating fordr. (vscA) 



Structural toxin protein (rfxA) 
Usteriolysln O («sA, hlyA) 



Mitogenlc toxin (dermonecrotic toxin) 

Hemolysin (hpmA) 

Cytotoxin (Enterotoxin A) 

Calmodulin-sensltive adenylate cyalase toxin 
Cytolysln (salmolysin) {styA) 
Enterotoxin {stn) 

Hemolysin {shIA) 

Shiga toxin (2 subunits) 
(stxA and stxB) 

ShET2 enterotoxin (senA) 

Enterotoxin 1 (aefl A and seflB) 
Hemolysin E {htyE. ctyK sheA) 

Shiga toxin (2 subunits) (stxA and stxB) 

Beta-hemolysin (hlyA) 

Gamma-hemolysln {hlg2) 

Enterotoxin 
Enterotoxin A (sea) 

Enterotoxin B 
Enterotoxin C1 (emC1) 
Enterotoxin C2 {entCZ) 
Enterotoxin C3 (en<C3) 
Enterotoxin D {seo) 
Enterotoxin E 



M17874 
M17873 
J01605 
AB011677 

L11241 

M58746 

M29255 

V0O612 

J01831 

U03293 
U04208 
U69305 
U42629 
U47048 

AF056581 

U53215 

U07145 

U80067 

U 80068 

AF077938 

AF077939 

AF077940 

AF077941 

AF057703 

X15127 
M24199 
X60035 
U25452 
U25443 
U25446 
U25449 

X57775, Z28388 

X51512 

X52478 

M30186 

X14956 

(cya) AF060869 
U03842 
L16014 

M22618 

X07903, M3251 1 
M19437 

M24352, M21947 

254211 

Z47381 

U35656 

AF200955 

AJ 132761 

L01270 

0421 43 
L01055 
U93688 

L22565, L22566 

M18970 

M11118 

X05815 

P34071 

X51661 

M 94672 

M21319 



80 



CA 02307010 2000-05-19 



Table 6. 



List of bacterial toxins selected for diagn stlc purposes (continued). 



Organism 



Toxin 



Staphylococcus aureus 



Staphylococcus epidermidis 
Staphylococcus intermedius 



Streptococcus pneumoniae 
Streptococcus pyogenes 



Vibrio choterae 



Vibrio parahaemotyticus 
Vibrio vulnificus 
Yersinia enterocolitica 

Yersinia kristensenii 
Yersinia pestis 



Enterotoxin G {seg) 
Enterotoxin H {sen) 
Enterotoxin I (sell 
Enterotoxin J 

Exfoliative toxin A (ETA) (Epidermolytic toxin A) 
Extoliative toxin B (ETB) 

Leukocidin R (F and S component, lukF and lukS) 
(Hemolysin B and C) 

Toxic shock syndrome toxin 1 (TSST-1) 
(alpha toxin) 
(alpha hemolysin) 



AF064773 

U11702 

AF064774 

AF053140 

M17347 

M17357 

L25372, M20371 

M17348, M13775 

X64389, S53213 

X72700 

L01055 

X01645 

M90536 

J02615 

U 93688 

AF068634 



Delta toxin (hJa) 

Enterotoxin 1 ^ t _ iJ?«?« 

Leukocidin R (F and S component, iukf and lukS) X791 88 
(synergohymenotroplc toxin) 



Pneumolysin 

Streptococcus pyrogenic exotoxin A (speA) 

Pyrogenic exotoxin B (speB) 

Cholerae toxin (ctxA and clxB subunits) 
Patents : 

JP 1995008278-A 1 
EP 0368819-A12(CtxB) 
WO 9313202-A 45 (ctxA) 



Accessory cholera enterotoxin (ace) 
Heat-stable enterotoxin (sto) 

Zonula occludens toxin (zof) 

Thermostable direct hemolysin (tdh) 

Cytolysin (whA) 

Heat-stable enterotoxin (ysf) 
Heat-stable enterotoxin type B (ysfB) 
Heat-stable enterotoxin type C (ystC) 

Enterotoxin 

Toxin 



X52474 

X61560 

(and 1 9 others) 

X03929 

U40453, M19350 
U63134 

M86905, M35110 

X00171 
X76390 
X58786 

X5B785, S55782 

D30052 

D30053 

K02679 

AF175708 

Z22569, AF175708 

X74108, M85198 

M97591.L03220 

M83563, AF 175708 

S67841 

M34670 

U09235, X65999 

D88145 

D63578 

X69218 

X92727 
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Table 7. Origin of the sequences in the sequence listing. 



SEQ ID NO. 



Bacterial.lungal or parasitical species 



Source 



Comments* 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



1 Acinetobacter baumannii 

2 Actinomyces meyeri 

3 Aerococcus viridans 

4 Achromobacter xylosoxidans subsp denitnficans 

5 Anaerorhabdus furcosus 

6 Bacillus anthracis 

7 Bacillus cereus 

8 Bacteroides distasonis 

g Enterococcus casseliflavus 

1 o Staphylococcus saprophytics 

1 -j Bacteroides ovatus 

12 Bartonella henselae 

1 3 Bifidobacterium adolescentis 

1 4 Bifidobacterium dentium 

15 Brucella abortus 

1 6 Burkholderia cepacia 

17 Cedecea davisae 
13 Gsdecsa r.eteri 

1 9 Cedecea lapagei 

20 Chlamydia pneumoniae 

2 1 Chlamydia psittaci 

22 Chlamydia trachomatis 

23 Chryseobacterium meningosepticum 

24 Citrobacter amalonaticus 

25 Citrobacter braakii 

26 Citrobacter koseri 

27 Citrobacter farmeri 

28 Citrobacter freundii 

29 Citrobacter sedlakii 

30 Citrobacter werkmanii _; _ 

31 Citrobacter youngae 

32 Clostridium perfringens 

33 Comamonas acidovorans 

34 Corynebacterium bovis 

35 Corynebacterium cervicis 
35 Corynebacterium flavescens 
37 Corynebacterium kutscheri 

33 Corynebacterium minutissimum 

3g Corynebacterium mycetoides 

40 Corynebacterium pseudogenitalium 

41 Corynebacterium renale 

42 Corynebacterium ulcerans 

43 Corynebacterium urealyticum 

44 Corynebacterium xerosis 

45 Coxiella burnetii 

46 Edwardsiella hoshinae 

47 Edwardsiella tarda 

48 Eikenella corrodens 

4g Enterobacter aerogenes 

50 Enterobacter agglomerans 

51 Enterobacter amnigenus 

52 Enterobacter asburiae 

53 Enterobacter cancerogenus 

54 Enterobacter cloacae 

55 Enterobacter gergoviae 

56 Enterobacter hormaechei 

57 enterobacter sakazakii 

58 Enterococcus casseliflavus 

59 Enterococcus cecorum 

60 Enterococcus dispar 

61 Enterococcus durans 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 
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Table 7. Origin of the sequences in the s quence listing (continued). 



SEQ ID NO. 



5 62 
63 
64 
65 
66 

10 67 
68 
69 
70 
71 

15 72 
73 
74 
75 
76 

20 77 
78 
79 
80 
81 

25 82 
83 
84 
85 
86 

30 87 
88 
89 
90 
91 

35 92 
93 
94 
95 
96 

40 97 
98 
99 
100 
101 

45 102 
103 
104 
105 
106 

50 107 
108 
109 
110 
111 

55 112 
113 
114 
115 
116 

60 117 
118 
119 
120 
121 

65 122 



Bact riaUungal or parasitical species 



Source 



Comments* 



Enterococcus faecalis 
Enterococcus faecalis 
Enterococcus faecium 
Enterococcus flavescens 
Enterococcus gallinarum 
Enterococcus hirae 
Enterococcus mundtii 
Enterococcus pseudoavium 
Enterococcus raffinosus 
Enterococcus saccharolyticus 
Enterococcus soiitarius 
Enterococcus casseliflavus 
Staphylococcus saprophytics 
Enterococcus fiavescens 
Enterococcus gallinarum 
Ehrlichia cants 
Escherichia coii 
Escherichia fergusonii 
Escherichia hermannii 
Escherichia vulneris 
Eubacterium lentum 
Eubacterium nodatum 
Ewingella americana 
Francisella tularensis 

Fusobacterium nucleatum subsp. potymorphum 

Gemella haemolysans 

Gemetla morbillorum 

Haemophilus actinomycetemcomitans 

Haemophilus aphrophilus 

Haemophilus ducreyi 

Haemophilus haemolyticus 

Haemophilus parahaemolyticus 

Haemophilus parainfluenzae 

Haemophilus paraphrophilus 

Haemophilus segnis 

Hafnia alvei 

Kingella kingae 

Klebsiella omithinolytica 

Klebsiella oxytoca 

Klebsiella planticola 

Klebsiella pneumoniae subsp. ozaenae 

Klebsiella pneumoniae pneumoniae 

Klebsiella pneumoniae subsp. rhinosderomatis 

Kluyvera ascorbata 

Kluyvera cryocrescens 

Kluyvera georgiana 

Lactobacillus casei subsp. casei 

Lactococcus ladis subsp. lactis 

Leclerda adecarboxylata 

Legionella micdadel 

Legionella pneumophila subsp. pneumophila 

Lemlnorella grimontii 

Leminoreila richardii 

Leptospira interrogans 

Megamonas hypermegale 

Mitsuokella multacidus 

Mobiluncus curtisii subsp. holmesii 

Moellerella wisconsensis 

Moraxella catarrhalis 

Morganella morganii subsp. morganii 

Mycobacterium tuberculosis 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 

tuf{C) 

unknown 

tuf{0) 

tuf{C) 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 
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Table 7. 



Origin of th sequences In the sequ nee listing (continued). 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



SEQ ID NO. 



123 

124 

125 

126 

127 

128 

129 

130 

131 

132 

133 

134 

135 

136 

137 

138 

139 

i -» v 

141 

142 

143 

144 

145 

146 

147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

160 
161 

162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 



Bacterial.fungal or parasitical species 



Sourc 



Neisseria cinerea 

Neisseria elongate subsp. elongata 
Neisseria flavescens 
Neisseria gonorrhoeae 
Neisseria lactamica 
Neisseria meningitidis 
Neisseria mucosa 
Neisseria sicca 
Neisseria subflava 
Neisseria weave ri 
Ochrobactrum anthropi 
Pantoea agglomerans 
Pantoea dispersa 
Pasteureita muttocida 
Peptostreptococcus anaerobius 
Peptostreptococcus asaccharoiyticus 
Peptostreptococcus prevotii 
PorphytornoDas asaccharolytica 

Porphyromonas gingivals 

Pragia fontium 

Prevotetla melaninogenica 

Prevoteila oralis 

Propionibacterium acnes 

Proteus mirabilis 

Proteus penned 

Proteus vulgaris 

Providenda alcaiifaciens 

Providencia rettgeri 

Providenda rustigianii 

Providenda stuartii 

Pseudomonas aeruginosa 

Pseudomonas fluorescens 

Pseudomonas stutzeri 

Psychrobacter phenyfpyruvicum 

Rahnella aquatilis 

Salmonella choleraesuis subsp.anzonae 
Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 
Salmonella choleraesuis subsp. diarizonae 
Salmonella choleraesuis subsp. choleraesuis 
serotype Heidelberg 

Salmonella choleraesuis subsp. houtenae 

Salmonella choleraesuis subsp. Indica 

Salmonella choleraesuis subsp. salamae 

Salmonella choleraesuis subsp. choleraesuis serotype Typhi 

Serratia fonticola 

Serratia liquefaciens 

Serratia marcescens 

Serratia odorifera 

Serratia plymuthica 

Serratia rubidaea 

Shigella boydii 

Shigella dysenteriae 

Shigella flexneri 

Shigella sonnel 

Staphylococcus aureus 

Staphylococcus aureus 

Staphylococcus aureus 

Staphylococcus aureus 

Staphylococcus aureus subsp. aureus 

Staphylococcus auricularis 

Staphylococcus capitis subsp. capitis 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent- 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



This patent 
This patent 

This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



Comments* 



tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
tuf 
iui 
tuf 
tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 
tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 
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Table 7. Origin of the sequences in the sequ nee listing (continued). 



SEQ ID NO. 



Bacterial.fungal or parasitical species 



Sourc 



Comments* 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



1 83 Macrococcus caseolyticus 

1 84 Staphylococcus cohnii 

1 85 Staphylococcus epidermidis 

1 86 Staphylococcus haemolyticus 

1 87 Staphylococcus warn eh 

1 88 Staphylococcus haemolyticus 

189 Staphylococcus haemolyticus 

1 90 Staphylococcus haemolyticus 

-j 91 Staphylococcus hominis subsp. hominis 

1 92 Staphylococcus wameri 

1 g3 Staphylococcus hominis 

1 94 Staphylococcus hominis 

1 95 Staphylococcus hominis 

1 96 Staphylococcus hominis 

1 97 Staphylococcus lugdunensis 

198 Staphylococcus saprophyticus 

1 99 Staphylococcus saprophyticus 

200 Staphylococcus saprophyticus 

201 Staphylococcus sciuri subsp. sciuri 

202 Staphylococcus warned 

203 Staphylococcus wameri 

204 Bifidobacterium longum 

205 Stenotrophomonas maltophilia 

206 Streptococcus acidominimus 

207 Streptococcus agalactiae 

208 Streptococcus agalactiae 

209 Streptococcus agalactiae 

21 0 Streptococcus agalactiae 

2 1 1 Streptococcus anginosus 

212 Streptococcus bovis 

213 Streptococcus anginosus 

214 Streptococcus cricetus 

2 1 5 Streptococcus cristatus 

2 1 6 Streptococcus downei 

2 1 7 Streptococcus dysgalactiae 

21 8 Streptococcus equi subsp. equ/ 

21 9 Streptococcus ferus 

220 Streptococcus gordonii 

22 1 Streptococcus anginosus 

222 Streptococcus macacae 

223 Streptococcus gordonii 

224 Streptococcus mutans 

225 Streptococcus parasanguinis 

226 Streptococcus ratti 

227 Streptococcus sanguinis 

228 Streptococcus sobrinus 

229 Streptococcus suis 

230 Streptococcus uteris 

231 Streptococcus vestibularis 

232 Tatumella ptyseos 

233 Trabulsiella guamensis 

234 Veillonella parvula 

235 Yersinia enterocoiitica 

236 Yersinia frederiksenii 

237 Yersinia intennedia 

238 Yersinia pestis 

239 Yersinia pseudotuberculosis 

240 Yersinia rohdei 

241 Yokenella regensburgei 

242 Achromobacter xylosoxidans subsp. denitrificans 

243 Acinetobacter baumannii 

244 Acinetobacter Iwoffii 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
. This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tut 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

atpD 

atpD 

atpD 



85 



CA 02307010 2000-05-19 



Table 7. Origin of the sequences in the sequence listing (continued). 



SEQ ID NO. 



BacteriaUungal or parasitical species 



Sourc 



Comments* 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



245 Staphylococcus saprophyticus 

246 Alcaligenes faecalis 

247 Bacillus anthracis 

248 Bacillus cereus 

249 Bacteroides distasonis 

250 Bacteroides ovatus 

251 Ledercia adecarboxylata 

252 Stenotrophomonas maltophilia 

253 Bartonella henselae 

254 Bifidobacterium adolescentis 

255 Brucella abortus 

256 Cedecea davisae 

257 Cedecea lapagei 

258 Cedecea neteri 

259 Chryseobacterium meningosepticum 

260 Citrobacter amalonaticus 

261 Citrobacter braakii 
2«2 Citrobacter koseri 

263 Citrobacter farmeri 

264 Citrobacter freundii 

265 Citrobacter koseri 

266 Citrobacter sedlakii 

267 Citrobacter werkmanii 

268 Citrobacter youngae 

269 Clostridium innocuum 

270 Clostridium perfringens 

272 Corynebacterium diphtheriae 

273 Corynebacterium pseudodiphtheriticum 

274 Corynebacterium ulcerans 

275 ~~ Corynebacterium urealyticum 

276 Coxielia burnetii 

277 Edwardsiella hoshinae 

278 Edwardsiella tarda 

279 Eikenella corrodens 

2 80 Enterobacter agglomerans 

28 1 Enterobacter amnlgenus 

282 Enterobacter asburiae 

283 Enterobacter cancerogenus 

284 Enterobacter cloacae 

285 Enterobacter gergovlae 

286 Enterobacter hormaechei 

287 Enterobacter sakazakii 

288 Enterococcus avium 

289 Enterococcus cassetiflavus 

290 Enterococcus durans 

291 Enterococcus faecalis 

292 Enterococcus faecium 

293 Enterococcus gallinarum 

294 Enterococcus saccharoiyticus 

295 Escherichia fergusonii 

296 Escherichia hermannil 

297 Escherichia vulneris 

298 Eubacterium lentum 

299 Ewingella americana 

300 Francisella tularensis 

301 Fusobacterium gonidiaformans 

302 Fusobacterium necrophorum subsp. necrophorum 

303 Fusobacterium nucleatum subsp. polymorphum 

304 Gardnerella vaginalis 

305 Gemella haemolysans 

306 Gemella morbillorum 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
-This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 
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Table 7. Origin of the sequences in the sequence listing (continued). 



SEQ ID NO. 



Bacterial.f ungal or parasitical sp cies 



Source 



Comments* 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



307 
308 
309 

310 

311 

312 

313 

314 

315 

316 

317 

318 

319 

320 

321 

322 

323 

324 

325 

326 

327 

328 

329 

330 

331 
332 
333 
334 
335 

-336 
337 
338 
339 
340 
341 
342 
343 

344 
345 
346 
347 

348 

349 
350 
351 

352 

353 
354 
355 
356 
357 
358 
359 
360 
361 
362 
363 



Haemophilus ducreyi 
Haemophilus haemotyticus 
Haemophilus parahaemolyticus 
Haemophilus parainfluenzae 
Hafnia alvei 
Klngella kingae 

Klebsiella pneumoniae subsp. ozaenae 
Klebsiella omithinolytica 
Klebsiella oxytoca 
Klebsiella planticola 

Klebsiella pneumoniae subsp. pneumoniae 
Kluyvera ascorbata 
Kluyvera cryocrescens 
Kluyvera georgiana 
Lactobacillus acidophilus 
Legionella pneumophila subsp. pneumophila 
Leminorella grimontii 
Listeria monocytogenes 
Micrococcus lylae 
Moellerella wisconsensis 
Moraxella catarrhalis 
Moraxella osloensis 
Morganella morganii subsp. morganii 
Pantoea agglomerans 
Pantoea dispersa 
Pasteurella multocida 
Pragia fontium 
Proteus mirabllis 
^Proteus vulgaris 
Provldencia alcalifaciens 
Providencia rettgeri 
Providencia rustigianii 
Providencia stuartii 
Psychrobacter phenytpyruvicus 
Rahnella aquatilis 

Salmonella choleraesuls subsp. arizonae 
Salmonella choleraesuls subsp. choleraesuis 
serotype Choleraesuis 
Salmonella choleraesuis subsp. diarizonae 
Salmonella choleraesuis subsp. houtenae 
Salmonella choleraesuis subsp. indica 
Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi B 
Salmonella choleraesuls subsp. salamae 
Salmonella choleraesuis subsp. choleraesuis serotype Typh 
Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

Salmonella choleraesuis subsp. choleraesuis 
serotype Virchow 
Serratia ficaria 
Serratia fonticola 
Serratia grimesii 
Serratia liquefadens 
Serratia marcescens 
Serratia odorifera 
Serratia plymuthica 
Serratia rubidaea 
Pseudomonas putida 
Shigella boydii 
Shigella dysenteriae 
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This patent 

This patent 
This patent 
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This patent 
This patent 
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This patent 
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This patent 
This patent 
This patent 
This patent 
This patent 
This pat nt 
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atpD 
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atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 



atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 

atpD 
atpD 
atpD 
atpD 

atpD 

apD 
atpD 
atpD 

atpD 

atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
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Tabl 7. 



Origin of the sequences In the sequence listing (continued). 



SEQ ID NO. 



Bacterial.fungal or parasitical species 



Source 



Comments* 



5 364 Shigella flexn ri 

365 Shigella sonn i 

366 Staphylococcus aureus 

367 Staphylococcus auricularis 

368 Staphylococcus capitis subsp. capitis 
10 369 Staphylococcus cohnii 

370 Staphylococcus epidermidis 

371 staphylococcus haemolyticus 

372 Staphylococcus hominis subsp. hominis 

373 Staphylococcus hominis 

15 374 Staphylococcus lugdunensis 

375 Staphylococcus saprophytics 

376 Staphylococcus simulans 

377 Staphylococcus wameri 

378 Streptococcus acidominimus 
20 379 Streptococcus agalactiae 

380 Streptococcus agalactiae 

381 StrepiGccccusagalsctiae 

382 Streptococcus agalactiae 

383 Streptococcus agalactiae 
25 384 Streptococcus dysgalactiae 

385 Streptococcus equi subsp . egu/ 

386 Streptococcus anginosus 

387 Streptococcus salivarius 

388 Streptococcus suis 
30 389 Streptococcus uberis 

390 Tatumella ptyseos 

391 Trabulsiella guamensis 

392 Yersinia bercovieri 

393 "Yersinia enterocolitica 
35 394 Yersinia frederiksenii 

395 Yersinia intermedia 

396 Yersinia pseudotuberculosis 

397 Yersinia rohdei 

398 Yokenefla regensburgei 
40 399 Varnoiv/a iipolytica 

400 Absidia corymbifera 

401 Alternaria altemata 

402 Aspergillus flavus 

403 Aspergillus fumigatus 
45 404 Aspergillus fumigatus 

405 Aspergillus niger 

406 Blastoschizomyces capitatus 

407 Candida albicans 

408 Candida albicans 
50 409 Candida albicans 

410 Candida albicans 

4 1 1 Candida albicans 

412 Candida dubliniensis 

41 3 Candida catenulata 
55 414 Candida dubliniensis 

41 5 Candida dubliniensis 

416 Candida famata 

41 7 Candida glabrata 

418 Candida guilliermondii 
60 419 Candida haemulonii 

420 Candida inconspicua 

421 Candida kefyr 

422 Candida krusei 

423 Candida lambica 
65 424 Candida lusitaniae 

425 Candida norvegensis 
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This patent 
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This patent 
This patent 
This patent 
This patent 
This patent 
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This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
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atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

fuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 
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Table 7. Origin of the s quences In the sequence listing (continued). 



SEQ ID NO. 



Bacterial .fungal or parasitical species 



Source 



Comments* 



5 426 Candida parapsilosis 

427 Candida njgosa 

428 Candida sphaerica 

429 Candida tropicatis 

430 Candida utitis 

10 431 Candida viswanathii 

432 Candida zeylanoides 

433 Coccidioides immitis 

434 Cryptococcus albidus 

435 Exophiala jeanselmei 
1 5 43s Fusarium oxyspomm 

437 Geotrichum sp. 

438 Histoplasma capsulatum 

439 issatchenkia orientalis kudrjanzev 

440 Malassezia furfur 

20 441 Malassezia pachydermatis 

442 Matbranchea filamentosa 

443 Metschnikowia pulchenima 

444 Paecilomyces iilacinus 

445 Paracoccidioides brasiliensis 
25 446 Penicillium mameffei 

447 P/c/i/a anomala 

448 P/c/i/a anomala 

449 Pseudallescheria boydii 

450 Rhizopus oryzae 
30 451 Rhodotorula minuta 

452 Sporobolomyces salmonicolor 

453 Sporothrix schenckii 

454 Stephanoascus cifenii 

455 Trichophyton mentagrophytes 
35 456 Trichosporon cutaneum 

457 Wangieila dermatitidis 

458 Aspergillus fumigatus 

459 Blastoschizomyces capitatus 

460 Candida albicans 
40 461 Candida dubliniensis 

462 Candida famata 

463 Candida glabrata 

464 Candida guilliermondii 

465 Candida haemulonii 
45 466 Candida Inconspicua 

467 Candida kefyr 

468 Candida krusei 

469 Candida tambica 

470 Candida lusitaniae 
50 471 Candida norvegensis 

472 Candida parapsilosis 

473 Candida rugosa 

474 Candida sphaerica 

475 Candida tropicalis 
55 476 Candida utilis 

477 Candida viswanathii 

478 Candida zeylanoides 

479 Coccidioides immitis 

480 Cryptococcus albidus 
60 481 Fusarium oxysporum 

482 Geotrichum sp. 

483 Histoplasma capsulatum 

484 Malassezia furfur 

485 Malassezia pachydermatis 
65 486 Metschnikowia pulcherrima 

487 Penicillium mameffei 
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fuf(EF-1) 
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tuf (EF-1) 

fuf(EF-l) 

fuf(EF-1) 

fuf(EF-1) 

ft/f(EF-1) 

tuf (EF-1) 

fuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

fuf (EF-1) 

tuf (EF-1) 

fuf (EF-1) 

fuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 

fuf (EF-1) 

fur (EF-1) 

tuf (EF-1) 

fc/f(EF«1) 

fuf(EF-1) 

fu/(EF-1) 

fii/(EF-1) 

fi/f(EF-1) 

tuf (EF-1) 

tuf (EF-1) 

fuf (EF-1) 

tuf (EF-1) 

tuf (EF-1) 
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atpD 
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Table 7. 



Origin of th sequences In the sequence listing (continued). 



SEQ ID NO. 



BacteriaMungal or parasitical species 



Sourc 



Comments* 



488 

489 

490 

491 

492 

493 

494 

495 

496 

497 

498 

499 

500 

501 

502 

503 

504 

5G5 

506 

507 

508 

509 

510 

511 

512 

513 

514 

515 

516 

517 

518 

519 

520 

521 

522 

523 

524 

525 

526 

527 

528 

529 

530 

531 

532 

533 

534 

535 

536 

537 

538 

607 

608 

609 

610 

611 

612 

613 
614 
615 
616 



Pichia anomala 
Pichia anomala 
Rhodotorula minuta 
Rhodotorula mucilaginosa 
Sporobolomyces salmonicolor 
Sporothrix schenckii 
Stephanoascus citerrif 
Trichophyton mentagrophytes 
Wangietla dermatitidis 
Yarrowia lipolytica 
Aspergillus fumigatus 
Blastoschizomyces capltatus 
Candida rugosa 
Coccidioides immitis 
Fusarium oxysponim 
Histoplasma capsulatum 
Paracoccidioides brasiliensis 
BeniGilttum~mam&ffei 
Pichia anomala 
Trichophyton mentagrophytes 
Yarrowia lipolytica 
Babesia bigemina 
Babesia bovis 
Crithidia fasciculata 
Entamoeba histolytica 
Giardia lamblia 
Leishmanla tropica 
Leishmania aethiopica 
Leishmania tropica 
- -Leishmania donovani 
Leishmania infantum 
Leishmania enriettii 
Leishmania gerbilU 
Leishmania hertigi 
Leishmania major 
Leishmania amazonensis 
Leishmania mexicana 

Leishmania tarentolae 

Leishmania tropica 

Neospora caninum 

Trichomonas vaginalis 

Trypanosoma brucei subsp. brucei 

Crithidia fasciculata 

Leishmania tropica 

Leishmania aethiopica 

Leishmania donovani 

Leishmania infantum 

Leishmania gerbilli 

Leishmania hertigi 

Leishmania major 

Leishmania amazonensis 

Enterococcus faecalis 

Enterococcus taecium 

Enterococcus gallinarum 

Haemophilus influenzae 

Staphylococcus epidermidis 

Salmonella choleraesuls subsp. choleraesuis 

serotype Paratyphi A 

Serratia ficaria 

Enterococcus malodoratus 

Enterococcus durans 

Enterococcus pseudoavium 



This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
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This patent 
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This patent 
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This patent 
WO98/20157 
WO98/20157 
WO98/20157 
Database 
WO98/20157 
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This patent 
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This patent 
This patent 
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atpD 

atpD 

tuf(M) 

tuf(M) 

tut{M) 

tuf(M) 

tuf(M) 

tuf(M) 

tuf (M) 

fuf(M) 

fuf(M) 

tuf(M) 

fuf(M) 

fuf(EF-1) 

fuf(EF-1) 

tuf (EF-1) 

ft/f(EF-1) 

firf(EF-l) 

ttf(EF-1) 

*v/(EF-1) 

ft/f(EF-1) 

tt//(EF-1) 

fuf(EF-1) 

fu/(EF-1) 

tuf(EF-l) 

fuf(EF-1) 

fuf(EF-1) 

fuMEF-1) 

fcif(EF-l) 

fcif(EF-l) 

fuf(EF-1) 

fuf(EF-1) 

fuf(EF-1) 

fu/{EF-1) 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

tuf 

tuf 

tuf 

tuf 

tuf 

tut 

tuf 

tuf(C) 
tuf(C) 
tuf(C) 
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Table 7. Origin of the sequences in the sequ nee listing (continued). 



SEQ ID NO. 



Bacterial^ ungal or parasitical species 



Sourc 



Comments* 



5 617 Enterococcus dispar 

618 Enterococcus avium 

619 Saccharomyces cere visiae 

621 Enterococcus faecium 

622 Saccharomyces cerevisiae 
10 623 Cryptococcus neoformans 

624 Candida albicans 

662 Corynebacten'um diphtheriae 

663 Candida catenulata 

665 Saccharomyces cerevisiae 

15 666 Saccharomyces cerevisiae 

667 Trypanosoma cwzi 

668 Corynebacterium glutamicum 

669 Escherichia coli 

670 Helicobacter pylori 

20 671 Clostridium acetobutylicum 

672 Cytophaga lytica 

673 Ehrlichia risticii 

674 V/fcrio cholerae 

675 V/b/io cholerae 

25 676 Leishmania enriettfi 

677 Babesia microti 

678 Cryptococcus neofomians 

679 Cryptococcus neoformans 

680 Cunninghamelta bertholietiae 
30 684 Candida tropicalis 

685 Enterococcus hirae 

686 Chlamydia pneumoniae 

687 Hatobacterium salinarum 

688 Human 

35 689 Plasmodium falciparum 

690 Saccharomyces cerevisiae 

69 1 Schizosaccharomyces pombe 

692 Trypanosoma congolense 

693 Thenmus thennophilus 
40 698 Escherichia coli 

709 Borrelia burgdorferi 

710 Treponema pallidum 

71 1 Chlamydia trachomatis 

7 1 2 Enterococcus faecalis 
45 713 Methanosarcina barken 

714 Methanococcus jannaschii 

715 Porphyromonas gingivalis 

716 streptococcus pneumoniae 

717 Burkhotderia mallei 

50 718 Burkholderia pseudomallei 

719 Clostridium beijerinckii 

720 Clostridium innocuum 

721 Clostridium novyi 

722 Clostridium septicum 
55 723 Clostridium tertium 

724 Clostridium tetani 

725 Enterococcus malodoratus 

726 Enterococcus sulfureus 

727 Lactococcus garvieae 
60 728 Mycoplasma pirum 

729 Mycoplasma salivarium 

730 Neisseria polysaccharea 

731 Salmonella cholera esuis subsp. choteraesuis 
serotype Enteritidis 

65 • 



This patent 

This patent 

Database 

This patent 

This patent 

This patent 

This patent 

WO98/20157 

This patent 

Database 

Database 

This patent 

Database 

Database 

Database 

Database 

Database 

This patent 

This patent 

This patent 

This patent 

This patent 

This patent 

This patent 

This patent 

Database 

Database 

Database 
Database 
Database 
Database 
Database 
Database 
Database 
Database 
Database 
genome project 
genome project 
genome project 
genome project 
Database 
Database 
genome project 
genome project 
This patent 
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This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 
This patent 



fuf(C) 

tuf(C) 

tuf(fA) 

tuf(C) 

fuf(EF-1) 

fu/(EF-1) 

hif(EF-1) 

tuf 

atpD 

fuf(EF-1) 

atpD 

atpD 

tuf 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

tuf 

atpD 

fu^EF-l) 

atpD 

atpD 

atpD 

atpD (V) 

arpD(V) 

atpD{V) 

atpD (V) 

atpD (V) 

atpD (V) 

atpD (V) 

atpD <V) 

atpD (V) 

atpD(V) 

tuf 

afp0(V) 

atpD (V) 

atpD (V) 

alpD (V) 

alpD (V) 

atpD (V) 

a<£D (V) 

alpD (V) 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 

tuf 



91 



CA 02307010 2000-05-19 



Table 7. Origin ol the sequences in the sequenc listing (continu d). 



SEQ ID NO. 



Bacterial .fungal or parasitical species 



Source 



Comments* 



5 732 Salmonella choleraesuis subsp. choleraesuis 

s rotype Gallinarum 

733 Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi B 

734 Salmonella choleraesuis subsp. choleraesuis 
10 serotype Virchow 

735 Serratia grimesii 

736 Clostridium difficile 

737 Burkholderia pseudomallei 

738 Clostridium bifermentans 
15 739 Clostridium beijerinckii 

740 Clostridium difficile 

741 Clostridium ramosum 

742 Clostridium septicum 

743 Clostridium tertium 

20 744 Comamonas acidovorans 

745 Klebsiella pneumoniae subsp. rhinoscleromatis 

746 Neisseria canis 

747 Neisseria cinerea 

748 Neisseria cuniculi 

25 749 Neisseria elongate subsp. etongata 

750 Neisseria flavescens 

751 Neisseria gonorrhoeae 

752 Neisseria gonorrhoeae 

753 Neisseria lactamica 
30 754 Neisseria meningitidis 

755 Neisseria mucosa 

756 Neisseria subflava 

757 Neisseria weaver! 
J758 ~Neisseriaanimaiis 

35 759 Proteus penneri 

760 Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

761 Yersinia pestis 

762 Burkholderia mallei 
40 763 Clostridium sordellii 

764 Clostridium novyi 

765 Clostridium botulinum 

766 Clostridium histolyticum 

767 Peptostreptococcus prevotii 
45 768 Absidia corymbifera 

769 Altemaria aitemata 

770 Aspergillus flavus 

771 Mucor circinelloides 

772 Piedraia hortai 

50 773 Pseudallescheria boydii 

774 Rhizopus oryzae 

775 Scopulariopsis koningii 

776 Trichophyton mentagrophytes 

777 Trichophyton tonsurans 
55 778 Trichosporon cutaneum 

j7S Cladophialophora carrionii 

780 Cunninghamella bertholletiae 

781 Curvularia lunata 

782 Fonsecaea pedrosol 
60 783 Microsporum audouinii 

784 Mucor circinelloides 

785 Phialophora verrucosa 

786 Saksenaea vaslformis 

787 Syncephalastrum racemosum 
65 788 Trichophyton tonsurans 

789 Trichophyton mentagrophytes 
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This patent 
This patent 
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This patent 
This patent 
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This patent 
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tuf 

tuf 

tuf 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

axpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpO 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

fuf(EF-1) 

fu/(EF-1) 

ftjf* (EF-1) 

fu/(EF-1) 

fuf (EF-1) 

ftff(EF-l) 

ttrf(EF-l) 

M<EF-1) 

tlrf(EF-l) 

fuf (EF-1) 

fuA(EF-1) 
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Table 7. Origin of the sequences in the sequence listing (continued). 





SEQ ID NO. 


Bacterial.f ungal or parasitical species 


Source 


Comments* 


5 


790 


Bipolaris hawaiiensis 


This pat nt 


fu/(EF-1) 




791 


Aspergillus fumigatus 


This patent 


tuf (M) 




792 


Trichophyton mentagrophytes 


This patent 


tuf (M) 




827 


Clostridium novyi 


This patent 


atpD (V) 




828 


Clostridium difficile 


This patent 


atpD (V) 


10 


829 


Clostridium septicum 


This patent 


atpD (V) 




830 


Clostridium botulinum 


This patent 


atpD (V) 




831 


Clostridium perfringens 


This patent 


atpD (V) 




832 


Clostridium tetani 


This patent 


atpD (V) 




833 


Streptococcus pyogenes 


Database 


atpD (V) 


15 


834 


Babesia bovis 


This patent 


atpD (V) 




835 


Cryptosporidium parvum 


This patent 


atpD (V) 




836 


Leishmania infantum 


This patent 


atpD (V) 




837 


Leishmania major 


This patent 


atpD (V) 




838 


Leishmania tarentotae 


This patent 


atpD (V) 


20 


839 


Trypanosoma brucei 


This patent 


atpD (V) 




840 


Trypanosoma cruzi 


This patent 


ftrf(EF-l) 




841 


Trypanosoma cruzi 


This patent 


fu/(EF-1) 




842 


Trypanosoma cruzi 


This patent 


fu/(EF-1) 




843 


Babesia bovis 


This patent 


fuf(M) 


25 


844 


Leishmania aethiopica 


This patent 


tuf(lA) 




845 


Leishmania amazonensis 


This patent 


tuf{M) 




846 


Leishmania donovani 


This patent 


tuf(M) 




847 


Leishmania infantum 


This patent 


tuf (M) 




848 


Leishmania enriettii 


This patent 


fu/(M) 


30 


849 


Leishmania ge&illi 


This patent 


tuf{M) 




850 


Leishmania major 


This patent 


tuf{lA) 




851 


Leishmania mexicana 


This patent 


tuf(tA) 




852 


Leishmania tarentotae 


This patent 


tuf(M) 




653 


Trypanosoma cruzi' 


This patent 


fu/(M) 


35 


854 


Trypanosoma cruzi 


This patent 


fuf(M) 




855 


Trypanosoma cruzi 


This patent 






856 


Babesia bigemina 


This patent 


atpD 




857 


Babesia bovis 


This patent 


atpD 




858 


Babesia microti 


This patent 


atpD 


40 


859 


Leishmania guyanensis 


This patent 


atpD 




660 


Leishmania mexicana 


This patent 


atpD 




861 


Leishmania tropica 


This patent 


atpD 




662 


Leishmania tropica 


This patent 


atpD 




663 


Bordetel/a pertussis 


Database 


tuf 


45 


864 


Trypanosoma brucei 


Database 


fuf(EF-1) 




865 


Cryptosporidium parvum 


This patent 


tuf (EF-1) 




866 


Staphylococcus saprophytics 


This patent 


atpD 




867 


Zoogloea ramlgera 


This patent 


atpD 




868 


Staphylococcus saprophytics 


This patent 


tuf 


50 


669 


Enterococcus casseliftavus 


This patent 


tuf 




870 


Enterococcus casseliftavus 


This patent 


tuf 




871 


Enterococcus flavescens 


This patent 


tuf 




872 


Enterococcus gallinarum 


This patent 


tuf 




873 


Enterococcus gallinarum 


This patent 


tuf 


55 


874 


Staphylococcus haemofyticus 


This patent 


tuf 




875 


Staphylococcus epidermidis 


This patent 


tuf 




876 


Staphylococcus epidermidis 


This patent 


tuf 




877 


Staphylococcus epidennidis 


This patent 


tuf 


60 


878 


Staphylococcus epidennidis 


This patent 


tuf 


879 


Enterococcus gallinarum 


This patent 


tuf 




880 


Pseudomonas aeruginosa 


This patent 


tuf 




881 


Enterococcus casseliflavus 


This patent 


tuf 




882 


Enterococcus casseliflavus 


This patent 


tuf 




883 


Enterococcus faecalis 


This patent 


tuf 


65 


884 


Enterococcus faecalis 


This patent 


tuf 




885 


Enterococcus faecium 


This patent 


tuf 
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Table 7. Origin of the sequences In the sequence listing (continued). 



SEQ ID NO. 



BacteriaMungal or parasitical species 



Source 



Comments* 



5 886 Enterococcus faecium 

887 Zoogloea ramigera 

888 Enterococcus iaecalis 

889 Aspergillus fumigatus 

890 Penicillium mameffei 
\0 891 Paecilomyces lilacinus 

892 Penicillium mameffei 

893 Sporothrix schenckii 

894 Malbranchea filamentosa 

895 Paecilomyces lilacinus 
15 896 Aspergillus niger 

897 Aspergillus fumigatus 

898 Penicillium mameffei 

899 Piedraia hortai 

900 Paecilomyces lilacinus 

20 901 Paracoccidioides brasiliensis 

902 Sporothrix schenckii 

303 RepJcUHum mameffei 

904 Curvularia lunata 

905 Aspergillus niger 
25 906 Bipoiaris hawaiiensis 

907 Aspergillus flavus 

908 Altemaria altemata 

909 Penicillium mameffei 

910 Penicillium mameffei 
30 918 Escherichia coll 

929 Bacteroides fragilis 

930 Bacteroides distasonis 

931 Porphyromonas asaccharolytica 

932 Listeria monocytogenes 
35 939 Saccharomyces cerevisiae 

940 Saccharomyces cerevisiae 

941 Cryptococcus humicolus 

942 Escherichia coli 

943 Escherichia coli 
40 944 Escherichia coli 

945 Escherichia coli 

946 Neisseria polysaccharea 

947 Neisseria sicca 

948 Streptococcus mitis 
45 949 Streptococcus mitis 

950 Streptococcus mitis 

951 Streptococcus oralis 

952 Streptococcus pneumoniae 

953 Streptococcus pneumoniae 
50 954 Streptococcus pneumoniae 

955 Streptococcus pneumoniae 

956 Babesia microti 

957 Entamoeba histolytica 

958 Fusobacterium nucleatum subsp. polymorphum 
55 959 Leishmania aethlopica 

960 Leishmania tropica 

961 Leishmania guyanensis 

962 Leishmania donovani 

963 Leishmania hertlgi 
60 964 Leishmania mexlcana 

965 Leishmania tropica 

966 Peptostreptococcus anaerobius 

967 Bordetella pertussis 

968 Bordetella pertussis 

65 969 Enterococcus columbae 



i ms paieni 




I ms paieni 


tt if 

lilt 


I ms paieni 


tuf 

(Iff 


i nis paieni 


atpD 


This patent 


atpD 


This patent 


atpD 


i nis paieni 


atpD 


This patent 


atpD 


1 nis paieni 


atpD 


TL.U r> n l nn i 

1 nis paieni 


atpD 


This patent 


atpD 


This patent 




I nis pateni 


TUT ^tr-l J 


This patent 


*• m4 ICC H \ 

iL// (cr- 1 ) 


This patent 


it// ^er- 1 ; 


This patent 


TUT \c.r- i ) 


TL|_ 

i nis patent 


+ t , 4 tec 1 \ 
TUT ^tr- 1 ; 


I his patent 


*■* »4 tec 1 \ 

TUT (tr-1 ) 


This patent 


tut (M) 


This pateni 


TUT (M) 


This patent 


*, ,4 ftiA\ 

TUT (M) 


This patent 


TUT \M) 


This patent 


TUT {M) 


This patent 


TUT (M) 


This patent 


TUT (M) 


Database 


rec/A 


This patent 


atpu (V) 


This patent 


atpu (v) 


This patent 


afpD (V) 


This patent 


tuf 


Database 


a- m — Jr j \ 

rec/l (nadov 


Database 


recA (Dmcl ) 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpD 


This patent 


atpu 


This patent 


atpD 


This patent 


arpu (V) 


i nis patent 


aipu \v) 


This patent 


atpD (V) 


This patent 


afr>D(V) 


This patent 


a<)pD(V) 


This patent 


alpD (V) 


This patent 


atpD (V) 


This patent 


afpD (V) 


This patent 


atpD (V) 


This patent 


afpD (V) 


This patent 


alpD (V) 


This patent 


tuf 


This patent 


tuf 


This patent 


tuf 



94 



CA 02307010 2000-05-19 



Table 7. Origin f the sequences In th sequence listing (continued). 



SEQ ID NO. Bacterial.fungal or parasitical species Source Comments* 



970 


Enterococcus flavescens 




This pat nt 


tuf 


971 


Streptococcus pneumoniae 




This patent 


tuf 


972 


Escherichia coli 




This patent 


tuf 


973 


Escherichia coli 




This patent 


tuf 


974 


Escherichia coli 




This patent 


tuf 


975 


Escherichia coli 




This patent 


tuf 


976 


Mycobacterium avium 




This patent 


tuf 


977 


Streptococcus pneumoniae 




This patent 


tuf 


978 


Mycobacterium gordonae 




This patent 


tuf 


979 


Streptococcus pneumoniae 




This patent 


tuf 


980 


Mycobacterium tuberculosis 




This patent 


tuf 


981 


Staphylococcus wameri 




This patent 


tuf 


982 


Streptococcus mitis 




This patent 


tuf 


983 


Streptococcus mitis 




This patent 


tuf 


984 


Streptococcus mitis 




This patent 


tuf 


985 


Streptococcus oralis 




This patent 


tuf 


986 


Streptococcus pneumoniae 




This patent 


tuf 


987 


Enterococcus hirae 




This patent 


tuf(C) 


988 


Enterococcus mundtii 




This patent 


tuf(C) 


989 


Enterococcus raffinosus 




This patent 


tuf(C) 


990 


Bacillus anthracis 




This patent 


recA 


991 


Prevotella melaninogenica 




This patent 


recA 


992 


Enterococcus casseliflavus 




This patent 


tuf 


993 


Streptococcus pyogenes 




Database 


speA 


1002 


Streptococcus pyogenes 




genome project 


tuf 


1003 


Bacillus cere us 




This patent 


recA 


1004 


Streptococcus pneumoniae 




This patent 


pbpla 


1005 


Streptococcus pneumoniae 




This patent 


pbpla 


1006 


Streptococcus pneumoniae 




This patent 


pbpla 


1007 


Streptococcus pneumoniae 




This patent 


pbpla 


1008 


: ^Streptococcus pneumoniae 


— 


This;patent 


pbpla 


1009 


Streptococcus pneumoniae 




This patent 


pbpla 


1010 


Streptococcus pneumoniae 




This patent 


pbpla 


1011 


Streptococcus pneumoniae 




This patent 


pbpla 


1012 


Streptococcus pneumoniae 




This patent 


pbpla 


1013 


Streptococcus pneumoniae 




This patent 


pbpla 


1014 


Streptococcus pneumoniae 




This patent 


pbpla 


1015 


Streptococcus pneumoniae 




This patent 


pbpla 


1016 


Streptococcus pneumoniae 




This patent 


pbpla 


1017 


Streptococcus pneumoniae 




This patent 


pbpla 


1018 


Streptococcus pneumoniae 




This patent 


pbpla 


1019 


Streptococcus pneumoniae 




This patent 


pbp2b 


1020 


Streptococcus pneumoniae 




This patent 


pbp2b 


1021 


Streptococcus pneumoniae 




This patent 


pbp2b 


1022 


Streptococcus pneumoniae 




This patent 


pbp2b 


1023 


Streptococcus pneumoniae 




This patent 


pbp2b 


1024 


Streptococcus pneumoniae 




This patent 


pbp2b 


1025 


Streptococcus pneumoniae 




This patent 


pbp2b 


1026 


Streptococcus pneumoniae 




This patent 


pbp2b 


1027 


Streptococcus pneumoniae 




This patent 


pbp2b 


1028 


Streptococcus pneumoniae 




This patent 


pbp2b 


1029 


Streptococcus pneumoniae 




This patent 


pbp2b 


1030 


Streptococcus pneumoniae 




This patent 


pbp2b 


1031 


Streptococcus pneumoniae 




This patent 


pbp2b 


1032 


Streptococcus pneumoniae 




This patent 


pbp2b 


1033 


Streptococcus pneumoniae 




This patent 


pbp2b 


1034 


Streptococcus pneumoniae 




This patent 


pbp2x 


1035 


Streptococcus pneumoniae 




This patent 


pbp2x 


1036 


Streptococcus pneumoniae 




This patent 


pbp2x 


1037 


Streptococcus pneumoniae 




This patent 


pbp2x 
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Table 7. Origin of the s quences in the s quence listing (continued). 



SEQ ID NO. 



Bacterial.fungal or parasitical species 



Source 



Comments* 



5 1 038 Streptococcus pneumoniae 

1039 Streptococcus pneumoniae 

1 040 Streptococcus pneumoniae 

1 041 Streptococcus pneumoniae 

1 042 Streptococcus pneumoniae 
10 1043 Streptococcus pneumoniae 

1044 Streptococcus pneumoniae 

1045 Streptococcus pneumoniae 

1 046 Streptococcus pneumoniae 

1 047 Streptococcus pneumoniae 
1 5 1 048 Streptococcus pneumoniae 

1 049 Enterococcus faecium 

1 050 Enterococcus galiinarum 

1051 Enterococcus faecium 

1 052 Enterococcus faecium 
20 1 053 Enterococcus faecium 

1 054 Enterococcus faecaiis 

1Q55 Enterococcus gaiiinarum 

1056 Enterococcus faecium 

1057 Enterococcus flavescens 
25 1058 Enterococcus galiinarum 

1 059 Enterococcus gaiiinarum 

1 060 Enterococcus casseliffavus 

1 061 Enterococcus casseiifiavus 

1 062 Enterococcus casseiifiavus 
30 1063 Enterococcus casseiifiavus 

1064 Enterococcus flavescens 

1 065 Enterococcus flavescens 

1 066 Enterococcus flavescens 

1067 —Enterococcus faecium 
35 1068 Enterococcus faecium 

1 069 Enterococcus faecium 

1 070 Enterococcus faecaiis 

1 07 1 Enterococcus galiinarum 

1072 Enterococcus faecium 
40 1 073 Enterococcus flavescens 

1074 Enterococcus faecium 

1 075 Enterococcus galiinarum 

1076 Escherichia coli 

1 077 Escherichia coli 

45 1 093 Staphylococcus saprophytics 

1117 Enterococcus faecium 

1 1 38 Enterococcus galiinarum 

1 1 39 Enterococcus faecium 

1 1 40 Enterococcus casseiifiavus 
50 1141 Enterococcus faecium 

1 1 69 Streptococcus pneumoniae 

1 1 72 Streptococcus pneumoniae 

1 1 73 Streptococcus pneumoniae 
1 1 78 Staphylococcus aureus 

55 1 1 83 Streptococcus pneumoniae 

1 1 84 Streptococcus pneumoniae 

1 1 85 Streptococcus pneumoniae 

1 1 86 Streptococcus pneumoniae 

1 1 87 Streptococcus pneumoniae 
60 



This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


pbp2x 


This patent 


vanA 


This patent 


vanA 


This patent 


vanA 


This patent 


vanA 


This patent 


vanA 


This patent 


vanA 


This patent 


vanA 


Ttiispaient 


vanA 


This patent 


vanA 


This patent 


vanC1 


This patent 


vanC1 


This patent 


vanC2 


This patent 


vanC2 


This patent 


vanC2 


This patent 


vanC2 


This patent 


vanC3 


This Datent 


vanC3 


This Datent 


vanC3 




vanXY 


This Datent 


vanXY 


This natent 


vanXY 


This natant 

1 1119 paid il 


vanXY 


This natont 


vanXY 


This natent 


vanXY 


This patent 


vanXY 


This patent 


vanXY 


This Datent 


vanXY 


Database 


stx-i 


Database 


StX2 


This patent 


unknown 


Database 


vanB 


Database 


vanC\ 


Database 


vanA 


Database 


vanC2 


Database 


vanHAXY 


Database 


pbpla 


Database 


pbp2b 


Database 


pbp2x 


Database 


mecA 


Database 


hexA 


This patent 


hexA 


This patent 


hexA 


This patent 


hexA 


This patent 


hexA 
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Table 7. Origin of the sequences in the sequence listing (continued). 





SEQ ID NO. 


Bactenal.fungal or parasitical species 


OUUI V/tJ 


Hnmrnfink* 

vUl 1 II 1 lol Ito 


5 


1188 


Streptococcus oralis 


This patent 


hexA 




1189 


Streptococcus mitis 


This patent 


hexA 




1190 


Streptococcus mitis 


This patent 


hexA 




1191 


Streptococcus mitis 


This patent 


hexA 




1198 


Staphylococcus saprophyticus 


This patent 


unknown 


10 


1215 


Streptococcus pyogenes 


Database 


PCP 




1230 


Escherichia coil 


Database 


ft//(EF-G) 




1242 


Enterococcus faecium 


Database 


ddi 




1243 


Enterococcus faecalis 


Database 


mtIF, mttO 


15 


1244 


Staphylococcus aureus subs p. aureus 


This patent 


unknown 



• tui indicates rufsequences, tuf (C) indicates tuf sequences divergent from main (usually A and B) copies of the elongation factor-Tu, 
tuf (EF-1) indicates tuf sequences of the eukaryotic type (elongation factor 1a), tuf (M) Indicates tuf sequences from organellar 
(mostly mitochondrial) origin. 
20 atpD indicates atpD sequences ot the F-type, atpD (V) indicates atpD sequences of the V-type. 

recA indicates recA sequences, /eo*(Rad51) Indicates radSl sequences or homologs and reo4(Dmc1) Indicates dmd sequences or 

homologs. 
speA indicates speA sequences. 
hexA indicates hexA sequences. 
25 mecA indicates mecA sequences. 

vanA indicates vanA sequences, vanB indicates vanB sequences, vanC1 indicates vanC1 sequences, vanC2 indicates vanC2 

sequences, vanC3 indicates vanC3 sequences, vanX Vindicates vanXKsequences, vanHAXY indicates vanHAXY sequences, 
pop fa indicates pop fa sequences, pbp2b indicates pbp2b sequences, pbp2x indicates pbp2x sequences. 
pep indicates pep sequences. 
30 stxi indicates six, sequences, stx 2 indicates sbc 2 sequences, 
unknown indicates an unknown gene. 
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Table 8. Bacterial sp cies used t t st the sp cificity of the Streptococcus aga/acf/ae-specific 
amplification primers derived from tuf sequences. 



Strain 



Reference number 



Strain 



Reference number 



Streptococcus acidominimus ATCC 51 726 

Streptococcus agalactiae ATCC 12403 

Streptococcus agalactiae ATCC 12973 

1 0 Streptococcus agalactiae ATCC 13813 

Streptococcus agalactiae ATCC 27591 

Streptococcus agalactiae CDCs 1 073 

Streptococcus anginosus ATCC 27335 

Streptococcus anginosus ATCC 33397 

1 5 Streptococcus bovis ATCC 3331 7 

Streptococcus anginosus ATCC 27823 

Streptococcus cricetus ATCC 19642 

Streptococcus cristatus ATCC 5 1 1 00 

Streptococcus downei ATCC 33748 

20 Streptococcus dysgalactiae A "[p c 43° 78 
Streptococcus equTs uBsp. equi A i CC~952S 

Streptococcus ferus ATCC 33477 

Streptococcus gordonii ATCC 1 0558 

Streptococcus macacae ATCC 3591 1 

25 Streptococcus mitis ATCC 49456 

Streptococcus mutans ATCC 25175 

Streptococcus oralis ATCC 35037 

Streptococcus parasanguinis ATCC 15912 

Streptococcus parauberis DSM 6631 

30 Streptococcus pneumoniae ATCC 27336 

Streptococcus pyogenes ATCC 19615 

Streptococcus ratth ATCC 19645 

Streptococcus salivarius ATCC 7073 

Streptococcus sanguinis ATCC 10556 

35 Streptococcus sobrinus ATCC 27352 

Streptococcus suis ATCC 43765 

Streptococcus uberis ATCC 1 9436 

Streptococcus vestubutaris ATCC 491 24 



Bacteroides caccae ATCC 431 85 

Bacteroides vulgatus ATCC 8482 

Bacteroides fragilis ATCC 25285 

Candida albicans ATCC 1 1 006 

Clostridium innoculum ATCC 14501 

Clostridium ramosum ATCC 25582 
Lactobacillus casei subsp. case/ ATCC 393 

Clostridium septicum ATCC 1 2464 

Corynebacterium cervicis N CTC 1 0604 

Corynebacterium genitalium ATCC 33031 

Corynebacterium urealyticum ATCC 43042 

Enterococcus faecalis ATCC 2921 2 

Enterococcus faecium ATCC 19434 

Eubacterium lentum ATCC 43055 

Eobacterrum notfuium ATGG 33G9S 

Gardnerella vaginalis ATCC 1401 8 

Lactobacillus acidophilus ATCC 4356 

Lactobacillus crispatus ATCC 33820 

Lactobacillus gasseri ATCC 33323 

Lactobacillus johnsonii ATCC 33200 

Lactococcus lactis subsp. lactis ATCC 19435 

Lactococcus lactis subsp. lactis ATCC 1 1454 

Listeria innocua ATCC 33090 

Micrococcus luteus ATCC 9341 

Eschenchia coli ATCC 25922 

Micrococcus tylae ATCC 27566 

Porphyromonas asaccharolytica ATCC 25260 

Prevotella corporis ATCC 33547 

Prevotella melanogenica ATCC 25845 

Staphylococcus aureus ATCC 1 3301 

Staphylococcus epidermidis ATCC 14990 

Staphylococcus saprophytics ATCC 1 5305 



40 
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Tabl 9. Bacterial sp cies used to test the specificity of the Streptococcus aga/actfae-specific 
amplification primers derived from atpD sequences. 



Strain Reference number Strain Reference number 



Streptococcus 


acidominimus 


ATCC 


51726 


Streptococcus 


gordonii 


ATCC 10558 


Streptococcus 


agalactiae 


ATCC 


12400 


Streptococcus 


macacae 


ATCC 35911 


Streptococcus 


agalactiae 


ATCC 


12403 


Streptococcus 


mitis 


ATCC 49456 


Streptococcus 


agalactiae 


ATCC 


12973 


Streptococcus 


/nutans 


ATCC 25175 


Streptococcus 


agalactiae 


ATCC 


13813 


Streptococcus 


oralis 


ATCC 35037 


Streptococcus 


agalactiae 


ATCC 


27591 


Streptococcus 


parasanguinis 


ATCC 15912 


Streptococcus 


agalactiae 


CDCs 


;-1073 


Streptococcus 


parauberis 


DSM 6631 


Streptococcus 


anginosus 


ATCC 


27335 


Streptococcus 


pneumoniae 


ATCC 27336 


Streptococcus 


anginosus 


ATCC 


27823 


Streptococcus 


pyogenes 


ATCC 19615 


Streptococcus 


bovis 


ATCC 


33317 


Streptococcus 


ratti 


ATCC 19645 


Streptococcus 


cricetus 


ATCC 


19642 


Streptococcus 


salivarius 


ATCC 7073 


Streptococcus 


chstatus 


ATCC 


51100 


Streptococcus 


sanguinis 


ATCC 10556 


Streptococcus 


downei 


ATCC 


33748 


Streptococcus 


sobrinus 


ATCC 27352 


Streptococcus 


dysgaiactiae 


ATCC 


43078 


Streptococcus 


suis 


ATCC 43765 


Streptococcus 


equi subsp. equi 


ATCC 


;9528 


Streptococcus 


uberis 


ATCC 19436 


Streptococcus 


ferus 


ATCC 


33477 


Streptococcus 


vestibularis 


ATCC 49124 
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Table 10. Bacterial species used t test th specificity of the fnferococcu^specific amplification 
primers d rived from tufs quences. 



Strain 



Reference number 



Strain 



Reference number 



5 Gram-positive species (n=74) 

Abiotrophia adiacens ATCC 491 76 

Abiotrophia defectiva ATCC 491 75 

Bacillus cereus ATCC 1 4579 

Bacillus subtills ATCC 27370 

10 Bifidobacterium adolescentis ATCC 27534 

Bifidobacterium breve ATCC 1 5700 

Bifidobacterium dentium ATCC 27534 

Bifidobacterium longum ATCC 1 5707 

Clostridium perfringens ATCC 31 24 

1 5 Clostridium septicum ATCC 1 2464 

Corynebacterium aquaticus ATCC 1 4665 

Corynebacterium ATCC 10700 
pseudodiphtheriticum 
Er.tsrccocc'JS sviurr, 

20 Enterococcus casseliflavus ATCC 25788 

Enterococcus cecorum ATCC 431 99 

Enterococcus columbae ATCC 51 263 

Enterococcus dispar ATCC 51 266 

Enterococcus durans ATCC 1 9432 

25 Enterococcus faecalis ATCC 2921 2 

Enterococcus faecium ATCC 1 9434 

Enterococcus flavescens ATCC 49996 

Enterococcus gallinarum ATCC 49573 

Enterococcus hirae - ATCC 8044 

30 Enterococcus mafodoratus - ATCC 43197 

Enterococcus mundtii ATCC 431 86 

Enterococcus pseudoavium ATCC 49372 

Enterococcus raffinosus ATCC 49427 

Enterococcus saccharolyticus ATCC 43076 

35 Enterococcus solitarius ATCC 49428 

Enterococcus sulfureus ATCC 49903 

Eubacterium lentum ATCC 49903 

Gemella haemolysans ATCC 1 0379 

Gemella morbillorum ATCC 27842 

40 Lactobacillus acidophilus ATCC 4356 

Leuconostoc mesenteroides ATCC 1 9225 

Listeria grayi ATCC 19120 

Listeria grayi ATCC 1 91 23 

45 " 



Listeria innocua ATCC 33090 

Listeria ivanovii ATCC 19119 

Listeria monocytogenes ATCC 1 531 3 

Listeria seeligeri ATCC 35967 

Micrococcus luteus ATCC 9341 

Pediococcus acidilacti ATCC 3331 4 

Pediococcus pentosaceus ATCC 3331 6 

Peptococcus niger ATCC 27731 

Peptostreptococcus anaerobius ATCC 27337 

Peptostreptococcus indolicus ATCC 29247 

Peptostreptococcus micros ATCC 33270 

Propionibacterium acnes ATCC 691 9 

Staphylococcus aureus ATCC 43300 

Staphylococcus capiiis ATCC 27840 

Staphylococcus epidermidis ATCC 14990 

Staphylococcus haemolyticus ATCC 29970 

Staphylococcus hominis ATCC 27844 

Staphylococcus tugdunensis ATCC 43809 

Staphylococcus saprophytics ATCC 1 5305 

Staphylococcus simulans ATCC 27848 

Staphylococcus wameri ATCC 27836 

Streptococcus agalactiae ATCC 1 381 3 

Streptococcus anginosus ATCC 33397 

Streptococcus bovis . ATCC 3331 7 

Streptococcus constellatus ATCC 27823 

Streptococcus cristatus ATCC 51 1 00 

Streptococcus intermedius ATCC 27335 

Streptococcus mitis ATCC 49456 

Streptococcus mitis ATCC 3639 

Streptococcus mutans ATCC 271 75 

Streptococcus parasanguinis ATCC 1 591 2 

Streptococcus pneumoniae ATCC 27736 

Streptococcus pneumoniae ATCC 6303 

Streptococcus pyogenes ATCC 1 961 5 

Streptococcus salivarius ATCC 7073 

Streptococcus sanguinis ATCC 1 0556 

Streptococcus suis ATCC 43765 
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Table 10. Bacterial species used to test the specificity of the Enterococcus genus-specific 
amplification primers derived fr m tuf sequences (continued). 



Strain 



Relerence number 



Strain 



Reference number 



5 Gram-negative species (n=39) 

Acidominococcus fermentans ATCC 2508 

Acinetobacter baumannii ATCC 19606 

Alcafigenes faecalis ATCC 8750 

Anaerobiospirillum ATCC 29305 

10 succiniproducens 

Anaerorhabdus furcosus ATCC 25662 

Bacteroides distasonis ATCC 8503 
Bacteroides thetaiotaomicron ATCC 29741 

Bacteroides vulgatus ATCC 8482 

1 5 Bordetella pertussis LS PQ 3702 

Bulkholderia cepacia LS PQ 22 1 7 

Butyvibrio fibrinosolvens ATCC 1 9 1 7 1 

Cardiobacterium hominis ATCC 15826 

Citrobacter freundii ATCC 8090 

20 Desulfovibrio vulgaris ATCC 29579 

Edwardsiellae tarda ATCC 15947 

Enterobacter cloacae ATCC 13047 

Escherichia coli ATCC 25922 

Fusobacterium russii ATCC 25533 

25 Haemophilus influenzae ATCC 9007 



Hafnia atvei ATCC 13337 

Klebsiella oxytoca ATCC 1 31 82 

Meganomonas hypermegas ATCC 25560 

Mitsukoella multiacidus ATCC 27723 

Moraxella catarrhalis ATCC 43628 

Morganella morganii ATCC 25830 

Neisseria meningitidis ATCC 1 3077 

Pasteurella aerogenes ATCC 27883 

Proteus vulgaris ATCC 13315 

Providencia alcalifaciens ATCC 9886 

Providencia rettgeri ATCC 9250 

Pseudomonas aeruginosa ATCC 27853 

Salmonella typhimurium ATCC 14028 

Serratia marcescens ATCC 1 3880 

Shigella flexneri ATCC 12022 

Shigella sonnei ATCC 29930 
Succinivibrio dextrinosolvens ATCC 19716 

Tissierella praeacuta ATCC 25539 

Veilioneila parvula ATCC 1 0790 

Yersinia enterocolitica ATCC 961 0 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available In 
public databases. 



Species 



Strain 



Accession number 



Coding gene* 



Bacteria 

Agrobacterium tumefaciens 

Agrobacterium tumefaciens 
5 Agrobacterium tumefaciens 

Anacystis niduians 

Aquifex aeolicus 

Aquifex aeolicus 

Aquifex pyrophilus 
10 Aquifex pyrophilus 

Archaeoglobus fulgidus 

Bacillus halodurans 

Bacillus halodurans 

Bacillus stearothermophilus 
X5 Bacillus subtilis 

Bacillus subtilis 

Bacillus subtilis 

Bacteroides forsythus 

Bacteroides fragilis 
20 Bordetella bronchiseptica 

Bordetella pertussis 

Borrelia burdorgferi 

Borrelia burgdorferi 

Brevibacterium linens 
25 Buchnera aphidicola 

Campylobacter jejuni 

Chlamydia pneumoniae 

Chlamydia pneumoniae 

Chlamydia trachomatis 
30 Chlamydia trachomatis 

Chlamydia trachomatis 

Chlamydia trachomatis 

Chlorobium vibrioforme 

Chloroflexus aurantiacus 
3 5 Clostridium acetobutylicum 

Clostridium difficile 

Corynebacterium glutamicum 

Corynebacterium glutamicum 

Coxielfa burnetii 
40 Cytophaga lytica 

Deinococcus radiodurans 

Deinococcus radiodurans 

Deinonema sp. 

Eikenella corrodens 
45 Eikenella corrodens 

Enterococcus faecalis 

Escherichia coli 

Escherichia coli 

Escherichia coli 
50 Escherichia coli 

Escherichia coli 

Escherichia coli 

Fervidobacterium islandicum 

Fibrobacter succinogenes 





X99673 


tuf 




X99673 


fuf (EF-G) 




VQQS74 


tuf 




X 17442 


tuf 


VF5 

V 1 %J 


AE000669 


tuf 


V l «J 


AE000669 


fuf(EF-G) 




Genome project 2 


fuf(EF-G) 




Y15787 


tuf 




Genome project 2 


tuf (EF-G) 




ABO 17508 


tuf 




AB01 7508 


fuf(EF-G) 


ppm piftd 


AJ 000260 




1 DO 


D64127 


fuf 


1 DO 


D64127 


fuf(EF-G) 




Z99104 


fuf 


atpp aw*7 


AB03546G 


tuf 




J 


tuf 


□ pen 


vaonuiTits piujtJLfi 


tuf 


i onama i 




tuf 


DOI 




tuf 




AFnm i^r 


tuf (EF-G) 

Iwl ^ I— • V* J 


Uolvl <£U*l£0 


Y7RRR'* 


tuf 


An 
Ap 




tuf 


IMO I U i il bo 


VI 71 C7 
Y I MO/ 


tuf 


L/WL029 


a enm cac 
AfcUUl 3?£ 


ft if 

IUI 


UW LU29 


R*5Q 


tuf /FF-f^ 

lui V3J 




M74991 


ttlf 


n/i iva/ o/oy 
U/UW-O/OA 


APnm *317 

ntUU I O I / 




n/i ha/ q/py 


ntUU 1 OUO 


tuf 

IUI 


r/lw Oar I O 


L22216 


tuf 


nCM ORQ 
UoM ^OO 




tuf 


UoM DJD 


Y7RRR*5 


tuf 






tuf 


RQH. 
OOU 


f^flnnmo nrniorr 

VJtJllUHItJ [Ji V^j OU I 


tuf 


Aon 1Q 


Y77nid 


tuf 


Nil I.OQQ 




tuf 


(Nine MU8 pna.se i 


Ml 1 OODlrr 


tuf 


UOIVI CUJ!7 


X77035 


tuf 


R1 


AE001891 


tuf (EF-G) 


R1 


Genome project 2 


tuf 






tuf 


ATCC 23834 


Z12610 


tuf 


ATCC 23834 


Z12610 


tuf (EF-G) 




Genome project 2 


fuf(EF-G) 




J01690 






J01717 


fuf 




X00415 


faf (EF-G) 




X57091 


tuf 


K-12 MG1655 


U00006 


tuf 


K-12MG1655 


U00096 


tuf 


DSM 5733 


Y15788 


tuf 


S85 


X76866 


tuf 
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Table 11. Microbial species f r whi h tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 





Species 


Strain 


Accession number 


Coding gene 11 




r iBVODSCTenum remgeneum 


L/OlVI 1 OOtH 


Y7COR7 


i uf 




riexistipes sinusarauici 




AOJJHD 1 


tuf 

l Li I 




\jlOBOD3Cl6f VtOtaCGUS 


POP 




tilf 




GloBOtheco sp. 


rbb OOU1 




it if 


5 


Haemophilus acunomycetemcomnans 


MM DOl 


vaenome project 


it tf 
IUT 




Haemophilus ducreyi 


ocnnn 


ArUo/*H *t 


IUT ^tr-Va; 




Haemophilus influenzae 


Rd 


U 32739 


tut 




Haemophilus influenzae 


HO 


U^/o9 


TUT (fcr-la) 




Hatobacten'um marismortui 




A IDDf / 


rur 


10 


Helicobacter pylori 


26695 


AE00051 1 


tut 




Helicobacter pylori 


J99 


Afc.001 539 






Helicobacter pylori 


J99 


AE001 541 


tut 




Herpetosiphon aurantiacus 


Hpgal 


V7CQCO 

X76868 


tut 




Lactobacillus paracasei 




CI OOOO 


it at 
TUT 


15 


Leptospira interrogans 




Ari i o^oo 


ft if 
TUT 




Methanobacterium thermoautrophicum 


delta n 


AbUUUo/ / 


4t tf 

TUT 




Methanococcus jannaschii 


A 1 4o0b / 


I IC7/1 QC 

Ubf4ob 


*t tf 
TUT 




Methanococcus jannaschii 


r%CM OCC1 


laenome project 


ft tf /cc n\ 
TUT (br-\jj 




Methanococcus vannielii 




AUOb9o 


ft tf 
TUT 


20 


Micrococcus luteus 


IPO 3333 


Ml 7788 


tut (th-G) 




Micrococcus luteus 


IFO 3333 


Ml 7788 


tut 




Moraxella sp. 


TAC II 25 


AJ249258 


tut 




Mycobacterium avium 


104 


Genome project 2 


tut 




Mycobacterium bovis 


AF21 22/97 


Genome project 2 


tut 


25 


Mycobacterium leprae 




L13276 


A, ,S 

tut 




Mycobacterium leprae 




Z14314 


tut 




Mycobacterium leprae 




Z14314 


ft/r (EF-G) 




Mycobacterium leprae 


Thai 53 


D 13869 


ftff 




Mycobacterium tuberculosis 


Erdmann 


S40925 




30 


Mycobacterium tuberculosis 


H37Rv 


A ■ f\ U f\ A*\ 

AL021943 


fur (EF-G) 




Mycobacterium tuberculosis 


H37Rv 


Z84395 


fuf 




Mycobacterium tuberculosis 


y42 


AD000005 


fuf 




Mycoplasma capricolum 


PG-31 


X16462 


tuf 




Mycoplasma genitalium 


G37 


U39732 


tut 


35 


Mycoplasma genitalium 


G37 


Genome project 


tuf (EF-G) 




Mycoplasma hominis 




X571 36 


tuf 




Mycoplasma hominis 


PG21 


M57675 


tuf 




Mycoplasma pneumoniae 


M129 


AE000019 


tuf 




Mycoplasma pneumoniae 


M129 


AE000058 


fuf (EF-G) 


40 


Neisseria gonorrhoeae 


MS1 1 


L36380 


tuf 




Neisseria gonorrhoeae 


MS1 1 


L36380 


fuf (EF-G) 




Neisseria meningitidis 




Genome project 2 


fuf (EF-G) 




Neisseria meningitidis 


Z2491 


Genome project 2 


tuf 




Peptococcus niger 


DSM 20745 


X76869 


tuf 


45 


Porphyromonas gingivalis 


W83 


Genome projecr 


tuf 




Phormidium ectocarpi 


PCC 7375 


U09443 


tuf 




Planobispora rosea 


ATCC 53773 


U67308 


tuf 




Plectonema boryanum 


PCC 73110 


U09444 


tuf 


50 


Porphyromonas gingivalis 


W83 


Genome project 2 


tuf 


Porphyromonas gingivalis 


FDC 381 


AB035461 


tuf 




Porphyromonas gingivalis 


W83 


AB035462 


tuf 




Porphyromonas gingivalis 


SUNY1021 


AB035463 


tuf 




Porphyromonas gingivalis 


A7A1-28 


AB035464 


tuf 




Porphyromonas gingivalis 


ATCC 33277 


AB035465 


tuf 


55 


Porphyromonas gingivalis 


ATCC 33277 


AB035471 


tuf (EF-G) 
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Tabl 11. Microbial speci s f r which tuf and/or atpD and/ r recA sequences are availab! in 
public databases (continu d). 



Species 


Strain 


Accession number 


Coding gene* 


Prochforothrix hollandica 




U09445 


tuf 


Pseudomonas aeruginosa 


PAO-1 


Genome project 2 


tuf 


Pyrococcus abyssi 


Orsay 


AJ248285 


tuf 


Rickettsia prowazekii 


Madrid E 


AJ235272 


tuf 


Rickettsia prowazekii 


Madrid E 


Z54171 


tuf (EF-G) 


Salmonella typhimurium 




X64591 


to/ (EF-G) 


Salmonella typhimurium 


LT2 trpE91 


X55116 


tuf 


Salmonella typhimurium 


LT2 trpE91 


X55117 


tuf 


Serpulina hyodysentenae 


B204 


U51635 


tuf 


Serratia marcescens 




AF058451 


tuf 


Shewanella putida 




Genome projecr 


tuf 


Shewanella putrefaciens 


DSM 50426 


_i 


tuf 


Shewanella putrefaciens 


MR-1 


Genome project 2 


tuf 


Splrochaeta aurantia 


DSM 1902 


X76874 


tuf 


Staphylococcus aureus 

Siigmatelia^vrarttiaca 




AJ237696 


tuf (EF-G) 


DW4 


X82820 


tuf 


Stigmatella aurantiaca 


Sga1 


X76870 


tuf 


Streptococcus mutans 


GS-5 Kuramltsu 


U75481 


tuf 


Streptococcus mutans 


UAB159 


Genome project 2 


tuf 


Streptococcus oralis 


NTCC 11427 


P331701 


tuf 


Streptococcus pyogenes 




Genome project 2 


tuf (EF-G) 


Streptococcus pyogenes 


M1-GAS 


Genome project 2 


tuf 


Streptomyces aureofaciens 


ATCC 10762 


AF007125 


tuf 


Streptomyces cinnamoneus 


Tue89 


X98831 


tuf 


Streptomyces coeiicolor 


A3(2) 


AL031013 


tuf (EF-G) 


Streptomyces coeiicolor 


M145 


X77039 


tuf 


Streptomyces coilinus 


BSM 40733 


S79408 


tuf 


Streptomyces netropsis 


Tu1063 


AF153618 


tuf 


Streptomyces ramocissimus 




X67057 


tuf 


Streptomyces ramocissimus 




X67057 


fuf (EF-G) 


Synechococcus sp. 




X17442 


tuf (EF-G) 


Synechocystis sp. 




X65159 


tuf (EF-G) 


Synechocystis sp. 


PCC 6803 


D90913 


tuf 


Taxeobacter occeaius 


Myx 2105 


X77036 


tuf 


Thermoplasma acidophilum 


DSM 1728 


X53866 


tuf 


Thermotoga maritima 




Genome project 2 


tuf (EF-G) 


Thermotoga maritima 




M27479 


tuf 


Thermus aquaticus 


EP 00276 


X66322 


tuf 


Thermus thermophilus 


HB8 


X16278 


tuf (EF-G) 


Thermus thermophilus 


HB8 


X05977 


tuf 


Thermus thermophilus 


HB8 


X06657 


tuf 


Thiomonas cuprina 


DSM 5495 


U78300 


tuf 


Thiomonas cuprina 


UoM o4yo 


i i7Qonn 
U/OOUU 


iuj ^fcr-ijij 


Thiomonas cuprina 


H065 


X76871 


tuf 


Treponema pallidum 






ft it 
XUl 


Treponema pallidum 




AE001248 


tuf (EF-G) 


Streptomyces ramocissimus 




X67058 


tuf 


Ureapiasma urealyticum 


ATCC 33697 


Z34275 


tuf 


Vibrio cholerae 


N 16961 


TIGR2 


tuf 


Wolinelia succinogenes 


DSM 1740 


X76872 


tuf 


Yersinia pestis 


CO-92 


Genome project 2 


tuf 


Fungi 








Absidia glauca 


CBS 101.48 


X54730 


ruf(EF-1) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



104 



CA 02307010 2000-05-19 



Table 11. Microbial species for which tuf and/or atpD and/or recA sequences ar available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Arxula adeninivorans 
Aspergillus oryzae 
Aureobasidium pullulans 
Candida albicans 
5 Candida albicans 

Cryptococcus neoformans 
Cryptococcus neoformans 
Eremothecium gossypii 
Fusarium oxysporum 

10 Histoplasma capsulatum 
Podospora anserina 
Puccinia graminis 
Rhizomucor racemosus 
Rhizomucor racemosus 

1 5 Rhizomucor racemosus 
Rhodotorula mucilaginosa 
Saccharomyces cerevisiae 
Saccharomyces cerevisiae 
Saccharomyces cerevisiae 

20 Saccharomyces cerevisiae 

Schizosaccharomyces pombe 
Trichoderma reesei 
Yarrowia lipotytica 

25 Parasites 

Blastocysts hominis 

Eimeria tenelia 

Giardia lamblia 
3 0 Kentrophoros sp. 

Leishmania amazonensis 

Leishmania braziliensis 

Onchocerca volvulus 

Porphyra purpurea 
3 5 Plasmodium berghei 

Plasmodium falciparum 

Plasmodium knowlesi 

Toxoplasma gondii 

Trypanosoma cruzi 
40 Trypanosoma brucei 

Human and plants 

Arabidopsis thaliana 
45 Glycine max 

Glycine max 

Glycine max 

Glycine max 

Homo sapiens 
50 Pyramimonas disomata 

Bacteria 



Ls3 

KBN616 

R106 

SC5314 

SC5314 

B3501 

M1-1G6 

ATCC 10895 

NRRL 26037 

186AS 

race 32 
ATCC 1216B 
ATCC 1216B 
ATCC 1216B 



QM9414 



HE87-1 
LS18 



IFLA/BR/67/PH8 



Avonport 

ANKA 

K1 

lineH 

RH 

Y 

LVH/75/ 
USAMRU-K/18 



Columbia 
Ceresia 
Ceresia 
Ceresia 
Maple Arrow 



Z47379 

AB007770 

U19723 

M29934 

M29935 

U81803 

U81804 

X73978 

AF008498 

U14100 

X74799 

X73529 

X17475 

J02605 

X17476 

AF016239 

K00428 

M59369 

X00779 

X01638 

U42189 

Z23012 

AF054510 



D64080 

AI755521 

D14342 

AF056101 

M92653 

U72244 

M64333 

U08844 

AJ224150 

X60488 

AJ224153 

Y11431 

L76077 

U10562 



X89227 
X89058 
Y15107 
Y15108 
X66062 
X03558 
AB008010 



fuf(EF-1 
fuf(EF-1 
tuf (EF-1 
fu/(EF-1 
tuf (EF-1 
fuf(EF-1 
fuf (EF-1 
tuf (EF-1 
fuf (EF-1 
fi/f(EF-1 
fuf* (EF-1 
ruf(EF-1 
tuf (EF-1 
ruf(EF-1 
firf(EF-1 
tuf (EF-1 
fu/(M) 
tuf (EF-G 
fuf*(EF-1 
ri/f(EF-1 
fuf(EF-1 
tuf (EF-1 
tuf (EF-1 



fu/(EF-1 
fuf* (EF-1 
fu/(EF-1 
ft/f(EF-1 
fu/(EF-1 
ftrf(EF-1 
fitf(EF-1 
tuf (EF-1 
fi/f(EF-1 
firf(EF-1 
ft/f(EF-1 
(EF-1 
fL/f (EF-1 
fuf(EF-1 



fu/(EF-1) 
fuf(EF-1) 
fuf(EF-1) 
tuf (EF-1) 
tuf (EF-1) 
fuf(EF-1) 
tuf 



Acetobacterium woodi 
55 Bacillus firmus 



DSM 1030 
OF4 



U10505 
M60117 



atpD 
atpD 
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Table 11. Microbial species for which tut and/ r atpD and/or recA sequences are available in 
public databases (continued). 



10 



15 



20 



25 
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55 



Species 


Strain 


Accession number 


Codina aene* 


Bacillus megaterium 


QM B1551 


M20255 


atpD 


Bacillus stearothermophilus 




D38058 


atpD 


Bacillus stearothermophilus 


IFO1035 


D38060 


atpD 


Bacillus subtilis 


168 


228592 


atpD 


Bacteroides fragilis 


DSM 2151 


M22247 


atpD 


Bordetella bronchiseptica 


RB50 


Genome project 2 


atpD 


Bordetella pertussis 


Tohama 1 


Genome project 2 


atpD 


Borrelia burgdorferi 


B31 


AE001122 


atpD (V) 


Burkholderia cepacia 


DSM50181 


X76877 


atpD 


Campylobacter jejuni 


NCTC 11168 


Genome project 2 


atpD 


Chlamydia pneumoniae 




Genome project 2 


atpD (V) 


Chlamydia trachomatis 


MoPn 


Genome project 2 


afpD (V) 


Chlorobium vibrioforme 


DSM 263 


X76873 


atpD 


Citrobacter freundii 


JEO503 


AF037156 


atpD 


Clostridium acetobutylicum 


ATCC 824 


Genome project 2 


atpD 


GlQStridiurr.-zestGbutflieuffi 


D5M792 


AF101055 


atpD 


Clostridium difficile 


630 


Genome project 2 


atpD 


Corynebacterium glutamicum 


ASO 19 


X76875 


atpD 


Corynebacterium glutamicum 


MJ-233 


E09634 


atpD 


Cytophaga lytica 


DSM 2039 


M22535 


atpD 


Enterobacter aerogenes 


DSM 30053 




atpD 


Enterococcus faecalis 




M90060 


atpD 


Enterococcus faecalis 


V583 


Genome project 2 


atpD (V) 


Enterococcus hirae 


ATCC 9790 


D 17462 


atpD (V) 


Escherichia coli 




J01594 


atpD 


-Escherichia~coii~ 


— " — 


M25464 


atpD 


Escherichia coli 


** 


V00267 


atpD 


Escherichia coli 




V00311 


atpD 


Escherichia coli 


K12 MG1655 


L1 0328 


atpD 


Flavobacterium ferrugineum 


DSM 13524 


3 


atpD 


Haemophilus actinomycetemcomitans 




Genome project 2 


atpD 


Haemophilus influenzae 


Rd 


U32730 


atpD 


Halobacterium salinarum 




S56356 


atpD (V) 


Haloferax volcanii 


WR 340 


X79516 


atpD 


Helicobacter pylori 


NCTC 11638 


AF004014 


atpD 


Helicobacter pylori 


26695 


Genome project 2 


atpD 


Helicobacter pylori 


J99 


Genome project 2 


atpD 


Lactobacillus casei 


DSM 20021 


X64542 


atpD 


Methanococcus jannaschii 


DSM 2661 


U67477 


atpD (V) 


Methanosarcina barker! 


DSM 800 


J04836 


atpD (V) 


Moorella thermoacetica 


ATCC 39073 


U64318 


atpD 


Mycobacterium avium 


104 


Genome project 2 


atpD 


Mycobacterium bovis 


AF21 22/97 


Genome project 2 


atpD 


Mycobacterium leprae 




U151 86 


atpD 


Mycobacterium tuberculosis 


H37Rv 


Z73419 


atpD 


Mycoplasma gallisepticum 




X64256 


atpD 


Mycoplasma genitalium 


G37 


U39725 


atpD 


Mycoplasma pneumoniae 


M129 


U43738 


atpD 


Neisseria gonorrhoeae 


FA 1090 


Genome project 2 


atpD 


Neisseria meningitidis 


Z2491 


Genome project 2 


atpD 


Pectinatus frisingensis 


DSM 20465 


X64543 


atpD 


Peptococcus niger 


DSM 20475 


X76878 


atpD 


Pirellula marina 


IF AM 1313 


X57204 


atpD 


Porphyromonas gingivalis 


W83 


Genome project 2 


atpD (V) 


Propionigenium modestum 


DSM 2376 


X58461 


atpD 



106 



CA 02307010 2000-05-19 



Table 11. Microbial sp cies for which tut and/ r atpD and/or recA sequences are available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Pseudomonas aeruginosa PA01 
Pseudomonas putida 

Rhodobacter capsulatus B1 00 

Rhodospirillum rubrvm 

5 Rickettsia prowazekii F-1 2 

Rickettsia prowazekii Madrid 

Ruminococcus albus 7ATCC 

Salmonella choleraesuis S83769 

subsp. arizonae 

10 Salmonella choleraesuis u24 

subsp. arizonae 

Salmonella bongori JE041 62 

Salmonella bongori BR1 859 

Salmonella choleraesuis DS21 0/89 
15 subsp. diarizonae 

Salmonella choleraesuis JEO307 
subsp. diarizonae 

Salmonella choleraesuis S 1 0967 1 

subsp. diarizonae 

20 Salmonella choleraesuis subsp. K228 

choleraesuis serotype Dublin 1 

Salmonella choleraesuis subsp. K771 

choleraesuis serotype Dublin 

Salmonella choleraesuis S84366 
25 —subsp. houtenae 

Salmonella choleraesuis S84098 

subsp. houtenae 

Salmonella choleraesuis BR2047 
subsp. indica 

30 Salmonella choleraesuis subsp. Div36-86 
choleraesuis serotype Infantis 

Salmonella choleraesuis NSC72 
subsp. salamae 

Salmonella choleraesuis S1 1 4655 

35 subsp. salamae 

Salmonella choleraesuis subsp. Div95-86 
choleraesuis serotype Tennessee 

Salmonella choleraesuis subsp. LT2 
choleraesuis serotype Typhimurium 

40 Shewanella putida 

Shewanella putrefaciens MR-1 

Stigmatella aurantiaca Sga1 

Streptococcus bovis JB-1 

Streptococcus mutans GS-5 

45 Streptococcus mutans UAB159 

Streptococcus pneumoniae Type 4 

Streptococcus pneumoniae Type 4 
Streptococcus pyogenes 

Streptococcus pyogenes M1 -GAS 

50 Streptococcus sanguinis 10904 

Streptomyces lividans 1 326 

Thermus thermophilus HB8 

Thiobacillus ferrooxidans ATCC 33020 

Treponema pallidum Nichols 
55 Vibrio alginolyticus 



Genome project 2 
Genome project 2 
X99599 
X02499 
AF036246 
Genome project 2 
AB006151 
AF037146 

AF037147 

AF037155 
AF037154 
AF037149 

AF037148 

AF037150 

AF037140 

AF037139 

AF037151 

AF037152 

AF037153 

AF037142 

AF037144 

AF037145 

AF037143 

AF037141 

Genome project 2 
Genome project 2 
X76879 
AB009314 
U31170 

Genome project 2 

Genome project 2 

Genome project 2 

Genome project 2 

Genome project 2 

AF001955 

Z22606 

D63799 

M81087 

AE001228 

X16050 



atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 
atpD 

atpD 

atpD 
atpD 
atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD 

atpD (V) 

atpD 

atpD (V) 

atpD 

atpD 

atpD 

atpD(V) 

atpD 

afpD(V) 

atpD 
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Table 11. Micr blal species for which tuf and/or atpD and/or recA sequences are available In 
public databases (c ntinued). 



Species 



10 



15 



20 



Vibrio cholerae 
Wolinella succinogenes 
Yersinia enterocolitica 
Yersinia pestis 

Fungi 

Candida tropicalis 
Kluyveromyces lactis 
Neurospora crassa 
Saccharomyces cerevisiae 
Saccharomyces cerevisiae 
Schizosaccharomyces pombe 
Schizosaccharomyces pombe 

Parasites 

Giardia iambiia 
Plasmodium falciparum 
Trypanosoma congolense 

Human and plants 



Homo sapiens 
25 — Homosapiens 

Bacteria 

Acetobacter aceti 

30 Acetobacter altoacetigenes 
Acetobacter poiyoxogenes 
Acholeplasma faidlawii 
Acidiphilium facilis 
Acidothermus cellulolyticus 

35 Acinetobacter calcoaceticus 
Aeromonas salmonicida 
Agrobacterium tumefaciens 
Allochromatium vinosum 
Aquifex aeoiicus 

40 Aquifex pyrophilus 

Azotobacter vinelandii 
Baciilus stearothermophilus 
Bacillus subtilis 
Bacillus subtilis 

45 Bacteroides fragilis 
Bifidobacterium breve 
Blastochloris viridis 
Bordetella pertussis 
Borrelia burgdorferi 

50 Borrelia burgdorferi 

Brevibacterium flavum 
Brucella abortus 
Burkholderia cepacia 
Campylobacter jejuni 

55 Chlamydia trachomatis 



Strain 



Accession number 



Coding gene* 



N16961 
DSM 1470 
NCTC 10460 
CO-92 



2359/152 



X2180-1A 
972 h- 
972 h- 



WB 
3D7 
IL3000 



Genome project 2 
X76880 
AF037157 
Genome project 2 



M64984 

U37764 

X53720 

M12082 

J05409 

S47814 

M57956 



U18938 
L08200 
Z25814 



L09234 
M27132 



no. 1023 


S60630 


MH-24 


E05290 


NBI 1028 


D13183 


8195 


M81465 


ATCC 35904 


D16538 


ATCC 43068 


AJ 006705 


BD413/ADP1 


L26100 


A449 


U83688 


C58 


L07902 




AJ 000677 


VF5 


AE000775 


Kol5a 


L23135 




S96898 


10 


Genome projecr 


PB1831 


U87792 


168 


299112 




M63029 


NCFB 2258 


AF094756 


DSM 133 


AF022175 


165 


X53457 


Sh-2-82 


U23457 


B31 


AE001124 


MJ-233 


E10390 


2308 


L00679 


ATCC 17616 


U70431 


81-176 


U03121 


L2 


U 16739 



atpD 
atpD 
atpD 
atpD 



atpD (V) 

atpD 

atpD 

atpD 

atpD (V) 

atpD (V) 

atpD 



atpD 
atpD (V) 
atpD (V) 



afpD(V) 
atpD 



recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (c ntinu d). 



Species 



Strain 



Accession number 



Coding gene* 



Chloroftexus aurantiacus 
Clostridium acetobutylicum 
Clostridium perfringens 
Corynebacterium glutamicum 
5 Corynebacterium pseudotuberculosis 
Deinococcus radiodurans 
Deinococcus radiodurans 
Enterobacter aggtomerans 
Enterococcus faecalis 
1 0 Erwinla carotovora 
Escherichia coli 
Escherichia coli 
Escherichia coli 
Frankia alni 
1 5 Gluconobacter oxydans 
Haemophilus influenzae 
Haemophilus influenzae 
Haemophilus influenzae 
Helicobacter pylori 
20 Helicobacter pylori 
Helicobacter pylori 
Lactococcus lactis 
Legionella pneumophila 
Leptospira biflexa 
25 Leptospira interrogans 

Magnetospirillum magnetotacticum 
Methylobacillus flagellatus 
Methylomonas clara 
Mycobacterium leprae 
30 Mycobacterium tuberculosis 

Mycoplasma genltalium 

Mycoplasma mycoides 

Mycoplasma pneumoniae 

Mycoplasma pulmonis 
3 5 Myxococcus xanthus 

Neisseria animatis 

Neisseria cinerea 

Neisseria cinerea 

Neisseria cinerea 
40 Neisseria cinerea 

Neisseria elongata 

Neisseria elongata 

Neisseria elongata 

subsp. intermedia 
45 Neisseria elongata 

Neisseria elongata 

Neisseria elongata 

Neisseria flava 

Neisseria fiavescens 
50 Neisseria gonorrhoeae 

Neisseria gonorrhoeae 

Neisseria gonorrhoeae 

Neisseria lactamica 

Neisseria lactamica 
55 Neisseria lactamica 



.1-1 O-fl 


AF037259 


recA 




M94057 


recA 


13 


U61497 


recA 


nOu i w 1 


U14965 


recA 


H2^1 


U30387 


recA 


IxUOOU 1 


AB005471 


recA 


R1 
n i 


U01876 


recA 




L03291 


recA 


UUIA 


M81466 


recA 




X55554 


recA 




J01672 


recA 




X55552 


recA 


VC-A. 9 


AE000354 


recA 


r\\ IO 


AJ006707 


recA 




U21001 


recA 


Rrl 


U32687 


recA 


nu 


U32741 


recA 


RH 


L07529 


recA 


CO A 


Z35478 


recA 




AE000536 


recA 


IQQ 


AP001 453 


recA 


Kill O 

MLo 


IVIOO 1 UU 


recA 




X55453 


recA 


serovar paiuu 


U32625 


recA 


serovar porn una 


U29169 


recA 


MC 1 

IVlO" I 


X17371 


recA 


yri/1 
JVJrlNI 




recA 


ATpp ^199fi 


X59514 


recA 




X73822 


recA 


H37RV 


X58485 


recA 


G37 


U39717 


recA 




L22073 


recA 




MPAE000033 


recA 


IS. LI/ JO 


I 99074 


recA 




L40368 


recA 


MpTp 10919 


U57910 


recA 




AJ223869 


recA 


I MP 1RAR 


U57906 

Ww' W WW 


recA 


MPTP 1H9QA 
INO 1 O 1 Utw 1 ** 


AJ223871 


recA 


VoUIUO IVtOw 1 1 


AJ223870 


recA 


ppl in 91^1 


AJ223882 


recA 


CCUG 4165A 


AJ223880 


recA 


CCUG 4557 


AJ223879 


recA 


NCTC 10660 


AJ223881 


recA 


NCTC 11050 


AJ223878 


recA 


NHITCC 2376 


AJ223877 


recA 


Bangor 9 


AJ223873 


recA 


LNP444 


U57907 


recA 


CH95 


U57902 


recA 


FA19 


X64842 


recA 


MS11 


X17374 


recA 


CCUC 7757 


AJ223866 


recA 


CCUG 7852 


Y11819 


recA 


LCDC 77-143 


Y11818 


recA 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available in 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Neisseria lactamica 
Neisseria lactamica 
Neisseria lactamica 
Neisseria lactamica 
5 Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
10 Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
Neisseria meningitidis 
15 Neisseria mucosa 
Neisseria mucosa 
Neisseria perfiava 
Neisseria perfiava 
Neisseria pharyngis var. flava 
20 Neisseria polysaccharea 
Neisseria polysaccharea 
Neisseria polysaccharea 
Neisseria polysaccharea 
Neisseria polysaccharea 
25 Neisseria sicca 

Neisseria subflava 
Paracoccus denitrificans 
Pasteurella multocida 
Porphyromonas gingivalis 
30 Prevotella ruminicota 
Proteus mirabilis 
Proteus vulgaris 
Pseudomonas aeruginosa 
Pseudomonas aeruginosa 
3 5 Pseudomonas aeruginosa 
Pseudomonas cepacia 
Pseudomonas fluorescens 
Pseudomonas putida 
Pseudomonas putida 
40 Rhizoblum leguminosarum 
biovar viciae 
Rhizobium phaseoli 
Rhodobacter capsulatus 
Rhodobacter sphaeroides 
45 Rhodopseudomonas palustris 
Rickettsia prowazekii 
Rickettsia prowazekii 
Serratia marcescens 
Shigella flexneri 
50 Shigella sonnei 

Sinorhizobium meliloti 
Staphylococcus aureus 
Streptococcus gordonii 
Streptococcus mutans 
55 Streptococcus pneumoniae 



LCDC 80-111 

LCDC 845 

NCTC 10617 

NCTC 10618 

44/46 

Bangor 13 

HF116 

HF130 

HF46 

M470 

N94II 

NCTC 8249 
P63 
S3446 
LNP 405 
VedrosM1801 
CCUG 17915 
LCDC 85402 
NCTC 4590 
CCUG 18031 
CCUG 24845 
CCUG 24846 
INS MA 3008 
NCTC 11858 
NRL30016 
JnIBL 30017 
DSM 413 

W83 

JCM 8958 
pG1300 



PAM7 
PA012 

OE28.3 

PpS145 
VF39 

CNPAF512 
J50 
2.4.1 
N7 

Madrid E 
Madrid E 



KNIH104S 
2011 

Challis V288 
UA96 



AJ223864 

AJ223865 

U57905 

AJ223863 

X64849 

AJ223868 

X64848 

X64844 

X64847 

X64850 

X64846 

AJ223867 

X64845 

U57903 

U57908 

AJ223875 

AJ223876 

AJ223862 

U57909 

Y11815 

Y11816 

Y11814 

Y11817 

U57904 

AJ223872 

AJ223874 

U59631 

X99324 

U70054 

U61227 

X14870 

X55555 

X05691 

X52261 

D13090 

D90120 

M96558 

L12684 

U70864 

X59956 

X62479 

X82183 

X72705 

D84467 

AJ235273 

U01959 

M22935 

X55553 

AF101227 

X59957 

L25893 

L20574 

M81468 

Z17307 



recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 

recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
recA 
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Table 11. Microbial species for which tuf and/or atpD and/or recA sequences are available In 
public databases (continued). 



Species 



Strain 



Accession number 



Coding gene* 



Streptococcus pneumoniae 

Streptococcus pyogenes 

Streptococcus salivarius 

subsp. thermophilus 
5 Streptomyces ambofaciens 

Streptomyces coelicolor 

Streptomyces lividans 

Streptomyces rimosus 

Streptomyces venezuelae 
10 Synechococcus sp. 

Thermotoga mahtima 

Thermus aquaticus 

Thermus thermophilus 

Thiobaciiius ferrooxidans 
15 Treponema pallidum 

Vibrio anguiilarum 

Vibrio cholerae 

Vibrio cholerae 

Vibrio cholerae 
20 Vibrio cholerae 

Vibrio cholerae 

Vibrio cholerae 

Vibrio cholerae 

Vibrio cholerae 
25 Vibrio cholerae 

Xanthomonas citri 

Xanthomonas oryzae 

Xenorhabdus bovienii 

Xenorhabdus nematophilus 
30 Yersinia pestis 

Fungi, parasites, human and plants 

Anabaena variabilis 
35 Arabidopsis thaliana 

Candida albicans 

Coprinus cinereus 

Emericella nidufans 

Gaflus gallus 
40 Homo sapiens 

Homo sapiens 

Leishmania major 

Leishmania major 

Mus muscuius 
45 Neurospora crassa 

Saccharomyces cerevisiae 

Schizosaccharomyces pombe 

Schizosaccharomyces pombe 

Tetrahymena thermophila 
50 Trypanosoma brucei 

Ustilago maydis 

Xenopus laevis 

Xenopus laevis 



nouu 


Z34303 


recA 


N7131 

l 1 vJ 1 


U21934 


recA 




M94062 


recA 


DSM 40697 


Z30324 


recA 




AL020958 


recA 


TKP4 


X76076 


recA 


Rfi 
no 


X94233 


recA 




U04837 


recA 


PRC 

r no 


M29495 


recA 


L23425 


recA 




L20095 


recA 




D 17392 


recA 




M26933 


recA 




AE001 243 


recA 




M80525 


recA 


017 


X71969 


recA 


2740-80 


U10162 


recA 


569B 


L42384 


recA 


M549 


AF1 17881 


recA 


M553 


AF1 17882 


recA 


M645 


AF 117883 


recA 


M793 


AF1 17878 


recA 


M794 


AF1 17880 


recA 


M967 


AF1 17879 


recA 


-.-XW47 


AF006590 


recA 




AF013600 


recA 


T228/1 


U87924 


recA 


AN6 


AF1 27333 


recA 


231 


X75336 


recA 



ATCC 29413 
Okayarna-7 



Friediin 
Friedlin 

74-OR23-1A 



972h- 
PB9R 
stock 427 



M29680 

U43652 

U39808 

U21905 

Z80341 

L09655 

D13804 

D63882 

AF062379 

AF062380 

D58419 

D29638 

D10023 

Z22691 

AL021817 

AF064516 

Y13144 

U62484 

D38488 

D38489 



recA 

recA (Rad5i; 
recA (Dmci; 
recA (Rad5i; 
recA (RadSi; 
recA (Rad5l) 
recA (RadSi; 
recA (Dmci; 
recA (Rad51 ; 
recA (DmdJ 
recA (Dmc1^ 
recA (Rad51 ) 
recA (RadSi; 
recA (RadSi; 
recA (Drndj 
recA (RadSi; 
recA (Rad51> 
recA (Rad51 ; 
recA (Rad51 ; 
recA (Rad5i; 
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Tab! 11. Micr bial species f r which tuf and/or atpD and/or recA sequences are available In 
public databases (continued). 

Species Strain Accession number Coding gene* 

* divergent from main (usually A and B) copies of the elongation factor-Tu 

tuf EF-1) indicates tuf sequences of the eukaryotic type (elongation factor 1a) 
tuf (M) indicates resequences from organellar (mostly mitochondrial) origin 
5 atpD indicates atpD sequences ot the F-type 

atpD (V) indicates atpD sequences of the V-Type 
recA Indicates recA sequences 
recA (Rad51 ) indicates rad51 sequences or homologs 
recA (Drnd ) indicates dmd sequences or homologs 
10 1 Nucleotides sequences published in Arch. Microbiol. 1990 153:241-247 

2 These sequences are from theTIGR database (http^/vwwJigr.orgAdb/tdb .ntml) 

3 Nucleotides sequences published in FEMS Microbiology Letters 1988 50:101-106 
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Table 12. Bacterial species used to test the specificity of the Staphylococcus genus-specific 
amplification primers derived from tut sequences. 



Strain 



Reference number 



Strain 



Reference number 



Staphylococci species (n=27) 

Staphylococcus arlettae ATCC 43957 

Staphylococcus aureus ATCC 35844 
subsp. anaerobius 

10 Staphylococcus aureus ATCC 43300 
subsp. aureus 

Staphylococcus auricularis ATCC 33753 

Staphylococcus capitis ATCC 27840 
subsp. capitis 

15 Staphylococcus caprae ATCC 35538 

Staphylococcus carnosus ATCC 51365 

Staphylococcus chromogenes ATCC 43764 

Staphylococcus cohnii DSM 20260 
subsp. urealyticum 

20 Staphylococcus delphini ATCC 49 1 71 

Staphylococcus epidermidis ATCC 1 4990 

Staphylococcus equorum ATCC 43958 

Staphylococcus felis ATCC 491 68 

Staphylococcus gallinarum ATCC 35539 

25 Staphylococcus haemolyticus ATCC 29970 

Staphylococcus hominis ATCC 27844 

Staphylococcus hyicus ATCC 1 1 249 

Staphylococcus intermedius ATCC 29663 

Staphylococcus kloosis ATCC 43959 

30 Staphylococcus lentus ATCC 29070 

Staphylococcus lugdunensis ATCC 43809 

Staphylococcus saprophytics ATCC 15305 

Staphylococcus schleiferi ATCC 49545 
subsp. coagulans 

35 Staphylococcus sciuri ATCC 29060 
subsp. sciuri 

Staphylococcus simulans ATCC 27848 

Staphylococcus warneri ATCC 27836 

Staphylococcus xylosus ATCC 29971 

40 Gram-negative bacteria (n=33) 

Acinetobacter baumannii ATCC 1 9606 

Bacteroides distasonis ATCC 8503 

Bacteroides fragilis ATCC 25285 

Bulkholderia cepacia ATCC 2541 6 

45 Bordetella pertussis ATCC 9797 

Citrobacter freundii ATCC 8090 

Enterobacter aerogenes ATCC 13048 

Enterobacter cloacae ATCC 1 3047 

Escherichia coll ATCC 25922 

50 Haemophilus influenzae ATCC 8907 
Haemophilus parahaemolyticus ATCC 1 001 4 

Haemophilus parainfluenzae ATCC 7901 

Hafnia alvei ATCC 1 3337 

Kingella indologenes ATCC 25869 

55 Klebsiella oxytoca ATCC 1 31 82 

Klebsiella pneumoniae ATCC 1 3883 

Moraxella catarrhalis ATCC 25240 



Other Gram-positive bacteria (n=20) 

Bacillus subtilis 
Enterococcus avium 



Enterococcus durans 

Enterococcus faecalis 
Enterococcus faecium 

Enterococcus flavescens 
Enterococcus gallinarum 
Lactobacillus acidophilus 
Lactococcus lactis 

Listeria innocua 
Listeria ivanovii 
Listeria monocytogenes 
Macrococcus caseolyticus 
Streptococcus agalactiae 
Streptococcus anginosus 
Streptococcus bovis 
Streptococcus mutans 
Streptococcus pneumoniae 
Streptococcus pyogenes 
Streptococcus salivarius 



ATCC 27370 
ATCC 14025 

ATCC 19432 

ATCC 19433 
ATCC 19434 

ATCC 49996 
ATCC 49573 
ATCC 4356 
ATCC 11454 

ATCC 33090 
ATCC 19119 
ATCC 15313 
ATCC 13548 
ATCC 13813 
ATCC 33397 
ATCC 33317 
ATCC 25175 

ATCC 6303 
ATCC 19615 

ATCC 7073 



Morganella morganli ATCC 25830 

Neisseria gonorrhoeae ATCC 35201 

Neisseria meningitidis ATCC 1 3077 

Proteus mirabilis ATCC 25933 

Proteus vulgaris ATCC 1 331 5 

Providencia rettgeri ATCC 9250 

Providencia stuartii ATCC 2991 4 

Pseudomonas aeruginosa ATCC 27853 

Pseudomonas fluorencens ATCC 13525 

Salmonella choleraesuis ATCC 7001 

Salmonella typhimurium ATCC 14028 

Serratia marcescens ATCC 81 00 

Shigella flexneri ATCC 1 2022 

Shigella sonnei ATCC 29930 
Stenotrophomonas maltophilia ATCC 1 3843 

Yersinia enterocolitica ATCC 961 0 
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Table 13. 



Bacterial species used to test the specificity f the penicillin-resistant Streptococcus 
pneumoniae amplification primers derived fr m pbpla sequences. 



Strain 



Reference number 



Strain 



Reference number 



Gram-positive species (n=67) 

Abiotrophia adiacens 
Abiotrophia defectiva 
Actinomyces pyogenes 
Bacillus anthracis 
Bacillus cereus 
Bifidobacterium breve 
Clostridium difficile 
Enterococcus avium 
Enterococcus casseliflavus 
Enterococcus dispar 
Enterococcus durans 
Enterococcus faecalis 
Enterococcus faecium 
Enterococcus flavescens 
Enterococcus gallinarum 
Enterococcus hfrae 
Enterococcus mundtii 
Enterococcus raffinosus 
Lactobacillus lactis 
Lactobacillus monocytogenes 
Mobiluncus curtisii 
Peptococcus niger 
Peptostreptococcus acones 
Peptostreptococcus anaerobius 
~ peptostreptococcus 
asaccharolyticus 
Peptostreptococcus lactolyticus 
Peptostreptococcus magnus 
Peptostreptococcus prevotii 
Peptostreptococcus tetradius 
Staphylococcus aureus 
Staphylococcus capitis 
Staphylococcus epidermidis 
Staphylococcus haemolyticus 



ATCC 49 1 75 Staphylococcus hominis 

ATCC 491 76 Staphylococcus lugdunensis 

ATCC 1 941 1 Staphylococcus saprophytics 

ATCC 4229 Staphylococcus simulans 

ATCC 14579 Staphylococcus, warned 

ATCC 1 5700 Streptococcus acidominimus 

ATCC 9689 Streptococcus agalactiae 

ATCC 1 4025 Streptococcus anginosus 

ATCC 25788 Streptococcus bovis 

ATCC 51 266 Streptococcus constellatus 

ATCC 1 9432 Streptococcus cricetus 

ATCC 2921 2 Streptococcus cristatus 

ATCC 1 9434 Streptococcus downei 

ATCC 49996 Streptococcus dysgalactiae 

ATCC 49573 Streptococcus equi 

ATCC 8043 Streptococcus ferus 

ATCC 431 86 Streptococcus gordonii 

ATCC 49427 Streptococcus intermedius 

ATCC 1 9435 Streptococcus mitis 

ATCC 1 53 1 3 Streptococcus mitis 

ATCC 35242 Streptococcus mitis 

ATCC 27731 Streptococcus mutans 

ATCC 691 9 Streptococcus oralis 

ATCC-27337 Streptococcus oralis 

ATCC 2639 Streptococcus oralis 

Streptococcus parasanguinis 

ATCC 51 1 72 Streptococcus parauberis 

ATCC 1 5794 Streptococcus rattus 

ATCC 9321 Streptococcus sal'rvarius 

ATCC 35098 Streptococcus sanguinis 

ATCC 25923 Streptococcus suis 

ATCC 27840 Streptococcus uberis 

ATCC 14990 Streptococcus vestibularis 
ATCC 29970 



Gram-negative species (n=33) 

Actinetobacter baumannii 
Bordetella pertussis 
Citrobacter diversus 
Citrobacter freundfi 
Enterobacter aerogenes 
Enterobacter agglomerans 
Enterobacter cloacae 
Escherichia coli 
Haemophilus ducreyi 
Haemophilus haemolyticus 
Haemophilus influenzae 
Haemophilus parainftuenzae 
Hafnia alvei 
Klebsiella oxytoca 
Klebsiella pneumoniae 
Moraxella atlantae 
Moraxella catarrhalis 



ATCC 1 9606 Moraxella morganii 

ATCC 9797 Neisseria gonorrhoeae 

ATCC 27028 Neisseria meningitidis 

ATCC 8090 Proteus mirabilis 

ATCC 1 3048 Proteus vulgaris 

ATCC 271 55 Providencia alcalifaciens 

ATCC 13047 Providencia rettgeri 

ATCC 25922 Providencia rustigianii 

ATCC 33940 Providencia stuartii 

ATCC 33390 Pseudomonas aeruginosa 

ATCC 9007 Pseudomonas fluorescens 

ATCC 7901 Pseudomonas stutzeri 

ATCC 13337 Salmonella typhimurium 

ATCC 1 31 82 Serratia marcescens 

ATCC 1 3883 Shigella flexneri 

ATCC 29525 Yersina enterocolitica 
ATCC 43628 



ATCC 27844 
ATCC 43809 
ATCC 15305 
ATCC 27848 
ATCC 27836 
ATCC 51726 
ATCC 12403 
ATCC 33397 
ATCC 33317 
ATCC 27823 
ATCC 19624 
ATCC 51100 
ATCC 33748 
ATCC 43078 
ATCC 9528 
ATCC 33477 
ATCC 10558 
ATCC 27335 
ATCC 903 
LSPQ 2583 
ATCC 49456 
ATCC 27175 
ATCC 10557 
ATCC 9811 
ATCC 35037 
ATCC 15912 
ATCC 6631 
ATCC 15912 
ATCC 7073 
ATCC10556 
ATCC 43765 
ATCC 19436 
ATCC 49124 



ATCC 13077 
ATCC 35201 
ATCC 13077 
ATCC 25933 
ATCC 13315 
ATCC 9886 
ATCC 9250 
ATCC 33673 
ATCC 33672 
ATCC 35554 
ATCC 13525 
ATCC 17588 
ATCC 14028 
ATCC 13880 
ATCC 12022 
ATCC 9610 
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Table 14 Bact rial species <n=104) del cted by the platelet contaminants assay. Bold characters 
indicate the major bacterial contaminants found in platelet concentrates. 



5 Abiotrophia adiacens 
Abiotrophia defectiva 
Acinetobacter baumannii 
Acinetobacter Iwoffi 
Aerococcus viridans 

10 Bacillus anthracis 
Bacillus cereus 
Bacillus subtills 
Brucella abortus 
Burkholderia cepacia 

1 5 Citrobacter diversus 
Citrobacter freundii 
Enterobacter aerogenes 
Enterobacter agglomerans 
Enterobacter cloacae 

20 Enterococcus avium 

Enterococcus casseliflavus, 
Enterococcus dispar 
Enterococcus durans 
Enterococcus faecalis 

25 Enterococcus faecium 
Enterococcus flavescens 
Enterococcus gallinarum 
Enterococcus mundtii 
Enterococcus raffinosus 

30 Enterococcus solitarius 
Escherichia coll 
Gemella morbillorum 
Haemophilus ducreyi 
Haemophilus haemolyticus 

35 Haemophilus influenzae 
Haemophilus 
parahaemolyticus 
Haemophilus parainfluenzae 
Hafnia alvei 

40 Kingella kingae 



Klebsiella oxytoca 
Klebsiella pneumoniae 

Legionella pneumophila 
Megamonas hypermegale 
45 Moraxella atlantae 
Moraxella catarrhalis 
Morganella morganii 
Neisseria gonorrheae 
Neisseria meningitidis 

50 Pasteurella aerogenes 
Pasteurella multocida 
Peptostreptococcus magnus 
Proteus mirabilis 
Providencia alcalifaciens 

55 Providencia rettgeri 
Providencia rustigianii 
Providencia stuartii 
Pseudomonas aeruginosa 
Pseudomonas fluorescens 

60 Pseudomonas stutzeri 
Salmonella bongori 
Salmonella choleraesuls 
Salmonella enteritidis 
Salmonella gallinarum 

65 Salmonella typhifaurium 
Serratia liquefaciens 
Serratla marcescens 
Shigella flexneri 
Shigella sonnei 

70 Staphylococcus aureus 
Staphylococcus capitis 
Staphylococcus epldermldls 
Staphylococcus haemolyticus 
Staphylococcus hominis 

75 Staphylococcus lugdunensis 
Staphylococcus saprophytics 



Staphylococcus simulans 
Staphylococcus warneri 
Stenotrophomonas mattophilia 

80 Streptococcus acidominimus 
Streptococcus agalactiae 
Streptococcus anginosus 
Streptococcus bovis 
Streptococcus constellatus 

85 Streptococcus cricetus 
Streptococcus cristatus 
Streptococcus dysgalactiae 
Streptococcus equi 
Streptococcus ferus 

90 Streptococcus gordonii 
Streptococcus intermedius 
Streptococcus macacae 
Streptococcus mitis 
Streptococcus mutans 

95 Streptococcus oralis 

Streptococcus parasanguinis 
Streptococcus parauberis 
Streptococcus pneumoniae 
Streptococcus pyogenes 

100 Streptococcus ratti 

Streptococcus salivarius 
Streptococcus sanguinis 
Streptococcus sobrinus 
Streptococcus uberis 

105 Streptococcus vestibularis 
Vibrio cholerae 
Yersinia enterocolitica 
Yersinia pestis 

Yersinia pseudotuberculosis 
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Abiotrophia adiacens {Streptococcus 
adjacens) 

Abiotrophia defective {Streptococcus 
defecilvus) 
5 Achromobacter species 

Acidaminococcus fermentans 
Acinetobacter alcaiigenes 
Acinetobacter anitratus 
Acinetobacter baumannil 
1 0 Acinetobacter catcoaceticus 

Acinetobacter calcoacetlcus blovar 
anitratus 

Acinetobacter caicoaceticus biovar 
Iwoffi 

1 5 Acinetobacter genomospecies 

Acinetobacter haemotytlcus 

Acinetobacter johnsonii 

Acinetobacter junii 

Acinetobacter twoffii 
20 Acinetobacter radioresistens 

Acinetobacter species 

Actinobaciiius actinomycetemcomitans 

Actinobaclitus capsutatus 

Actinobaciiius equufl 
25 A ctinobaciiiu s hominis 

Actinobaciiius lignieresii 

Actinobaciiius pleuropneumoniae 

Actinobaciiius species 

Actinobaciiius suis 
30 Actinobaciiius ureae 

Actinomyces bovis 

Actinomyces Israelii 

Actinomyces meyeri 

Actinomyces naeslundii 
3 5 Actinomyces neuii subsp. anitratus 

Actinomyces neuii subsp. neuii 

Actinomyces odontoiyticus 

Actinomyces pyogenes 

Actinomyces radingae 
40 Actinomyces species 

Actinomyces tuhcensis 

Actinomyces viscosus 

Aerococcus species 

Aerococcus viridans 
45 Aeromonas cavtae 

Aeromonas hydrophila 

Aeromonas hydrophila group 
Aeromonas jandael 
Aeromonas salmonicida 
50 Aeromonas salmonicida subsp. 
achromogenes 

Aeromonas salmonicida subsp. 
masoucida 

Aeromonas salmonicida subsp. 
55 salmonicida 

Aeromonas schubertii 

Aeromonas sobrla 

Aeromonas species 

Aeromonas trota 
60 Aeromonas veronii 

Aeromonas veronii biovar sobria 

Aeromonas veronii biovar veronii 

Agrobacterium radiobacter 

Agrobacterium species 
65 Agrobacterium tumefaclens 

Alcaiigenes denitrificans 

Alcaiigenes faecalis 

Alcaiigenes odorans 

Alcaiigenes odorans (Alcaiigenes 
70 faecalis) 

Alcaiigenes species 

Alcaiigenes xytosoxidans 

Alcaiigenes xy/osoxldans subsp. 

denitrificans 



7 5 Alcaiigenes xytosoxidans subsp. 

xytosoxidans 
Alloiococcus otitis 

Anaeroblospirillum succiniciproducens 
Anaerovibrio Hpotytica 
80 Arachnia propionica 

Arcanobacterium {Actinomyces) 
bemardiae 

Arcanobacterium {Actinomyces) 
pyogenes 

8 5 Arcanobacterium haemolyticum 

Arcobacter cryaerophitus 
(Campylobacter cryaerophila) 
Arthrobacter globiformis 
Arthrobacter species 
90 Arxiozyma telturis (Torulopsis 
pintolopesll) 

Atopoblum mlnutum (Lactobacillus 
minutus) 

Aureobacterium species 
95 Bacillus amylollquefaciens 
Bacillus anthracis 
Bacillus badius 
Bacillus cereus 
Bacillus circuians 
100 Bacillus coaguians 
Bacillus firmus 
Bacillus lentus 
Bacillus licheniformis 
Bacillus megaterium 
105 Bacillus mycoides 

Bacillus pantothentlcus 
Bacillus pumilus 
Bacillus species 
Bacillus sphaericus 
110 Bacillus stearothermophilus 
Bacillus subtllls 
Bacillus thuringiensls 
Bacteroides caccae 
Bacteroides caplllosus 
115 Bacteroides distasonis 
Bacteroides eggerthii 
Bacteroides fragftis 
Bacteroides merdae 
Bacteroides ovatus 
120 Bacteroides species 

Bacteroides splanchnicus 
Bacteroides stercoris 
Bacteroides thetaiotaomlcron 
Bacteroides uniformis 
1 25 Bacteroides ureolyticus (B. corrodem 
Bacteroides vulgatus 
Bergeyella (Weeksetla) zooheicum 
Bifidobacterium adolescentis 
Bifidobacterium bifidum 
1 30 Bifidobacterium breve 
Bifidobacterium dentium 
Bifidobacterium infantis 
Bifidobacterium species 
Blastoschizomyces (Dipodascus) 
135 capitatus 

Bordetella avium 
Bordetella bronchiseptica 
Bordetella parapertussis 
Bordetella pertussis 
140 Bordetella species 
Borrella species 

Branhamella (Moraxella) catarrhaiis 
Branhamella species 
Brevibacillus brevis 
145 Brevibacillus iaterosporus 
Brevibacterium easel 
Brevibacterium epidermidis 
Brevibacterium linens 



Brevibacterium species 
1 50 Brevundimonas (Pseudomonas) 
diminuta 

Brevundimonas (Pseudomonas) 
vesicularis 

Brevundimonas species 
155 Brochothrix thermosphacta 

Brucella abortus 

Brucella canis 

Brucella melitensis 

Brucella ovis 
1 60 Brucella species 

Brucella suis 

Budvicia aquatica 

Burkholderia (Pseudomonas) cepacia 
Burkholderia (Pseudomonas) gladioli 
1 65 Burkholderia (Pseudomonas) mallei 
Burkholderia (Pseudomonas) 
pseudomallei 
Burkholderia species 
Buttiauxella agrestls 
1 70 Campylobacter coli 

Campylobacter concisus 

Campylobacter fetus 

Campylobacter fetus subsp. fetus 

Campylobacter fetus subsp. 
175 venerealis 

Campylobacter hyointestinalis 

Campylobacter jejuni subsp. doylei 

Campylobacter jejuni subsp. jejuni 

Campylobacter lari 
1 80 Campylobacter lari subsp. UPTC 

Campylobacter mucosalis 

Campylobacter species 

Campylobacter sputorum 

Campylobacter sputorum subsp. 
185 bubutus 

Campylobacter sputorum subsp. 

fecalis 

Campylobacter sputorum subsp. 
sputorum 
1 90 Campylobacter upsaliensis 

Candida (Clavispora) lusltaniae 
Candida (Pichia) guHliermondii 
Candida (Torulopsis) giabrata 
Candida albicans 
195 Candida boidinii 

Candida catenulata 
Candida cifenii 
Candida cofliculosa 
Candida conglobata 
200 Candida curvata (Cryptococcus 
curvatus) 
Candida dattiia 
Candida dubliniensis 
Candida famata 
205 Candida gtobosa 
Candida hellenica 
Candida holmli 
Candida humicola 

Candida inconspicua 
210 Candida intermedia 

Candida kefyr 

Candida krusel 

Candida lambica 

Candida magnotiae 
215 Candida maris 

Candida melibiosica 

Candida membrane efa ciens 

Candida norvegensis 

Candida norvegica 
220 Candida parapsilosis 

Candida paratropicatis 

Candida pelliculosa 
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Candida pseudotropicaUs 
Candida pulcherrima 
Candida ravautii 
Candida rugosa 
5 Candida sake 
Candida sitvicola 
Candida species 
Candida sphaerica 
Candida staliatoidea 
10 Candida tenuis 
Candida troplcalls 
Candida utifis 
Candida valida 
Candida vini 
15 Candida viswanathii 
Candida zeyianoides 
Capnocytophaga gingtvalls 
Capnocytophaga ochracea 
Capnocytophaga species 
20 Capnocytophaga sputigena 
Cardiobacterium hominis 
Camobacterium dtvergens 
Camobacterium piscicola 
CDC group ED-2 
25 CDC group EF4 {Pasteureiia sp.) 
CDC group EF-4A 
CDC group EF-4B 
CDC group EQ-Z 
CDC group HB-5 
30 CDC group II K-2 

CDC group IV C-2 {Bordetella-\toe) 
CDC group M5 
COC group M6 
Cedecea davisae 
3 5 Cedecea lapagei 
Cedecea neteri 
Cedecea species 

. Ceilutomonas {Oerskovia) turbata 
Celtulomonas species 
40 Chlamydia species 

Chromobacterium violaceum 
Chryseobacterium (Flavobacterium) 
indoiogenes 

Chryseobacterium {Flavobacterium) 
45 meningosepticum 

Chryseobacterium gleum 

Chryseobacterium species 

Chryseomonas indoiogenes 

Citeromyces matritensis 
50 Cifrobacter amaionaticus 

Citrobacter braakii 

Citrobacter diversus 

Citrobacter farmeri 

Citrobacter freundii 
5 5 Citrobacter freundii complex 

Citrobacter koseri 

Citrobacter sedtakii 

Citrobacter species 

Citrobacter werkmanii 
60 Citrobacter youngae 

Clostridium acetobutylicum 

Clostridium barati 

Clostridium beljerinckll 

Clostridium bifermentans 
65 Clostridium botulinum 

Clostridium botunnum {HP) B&F 

Clostridium botulinum (IMP) E 

Clostridium botulinum (P) A&H 

Clostridium botulinum (P) F 
70 Clostridium botulinum G 1 

Clostridium botulinum G2 

Clostridium butyricum 

Clostridium cadaveris 

Clostridium chauvoei 
7 5 Clostridium clostridiiforme 

Clostridium difficile 

Clostridium fallax 

Clostridium glycoticum 



Clostridium hastiforme 
80 Clostridium histotyticum 
Clostridium innocuum 
Clostridium limosum 
Clostridium novyi 
Clostridium novyi A 
85 Clostridium paraputrificum 
Clostridium periringens 
Clostridium putrificum 
Clostridium ramosum 
Clostridium septlcum 
90 Clostridium sordellii 
Clostridium species 
Clostridium sphenoldes 
Clostridium sporogenes 
Clostridium subterminale 
95 Clostridium tertium 
Clostridium tetani 
Clostridium tyrobutyricum 

Comamonas {Pseudomonas) 
acidovorans 
1 00 Comamonas (Pseudomonas) 

testosteronl 

Comamonas species 

Corynebacterium accolens 

Corynebacterium afermentans 
1 05 Corynebacterium amycola turn 

Corynebacterium aquaticum 

Corynebacterium argentoratense 

Corynebacterium auris 

Corynebacterium bovis 
110 Corynebacterium coyleae 

Corynebacterium cystitidis 

Corynebacterium diphtheriae 

Corynebacterium diphtheriae biotype 

betfantl 

115 Corynebacterium diphtheriae biotype 
gravis 

- Corynebacterium diphtheriae biotype 
intermedius 

Corynebacterium diphtheriae biotype 

120 mitis 

Corynebacterium fiavescens 
Corynebacterium glucuronolyticum 
Corynebacterium glucuronotyticum- 
seminale 

1 25 Corynebacterium group A 
Corynebacterium group A-4 
Corynebacterium group A-5 
Corynebacterium group ANF 
Corynebacterium group B 

1 30 Corynebacterium group B-3 
Corynebacterium group F 
Corynebacterium group F-1 
Corynebacterium group F-2 
Corynebacterium group G 

135 Corynebacterium group G-1 
Corynebacterium group G-2 
Corynebacterium group I 
Corynebacterium group 1-2 
Corynebacterium jeikeium (group JK) 

1 40 Corynebacterium kutscheri ( C. 
murium) 

Corynebacterium macginleyi 
Corynebacterium minutissimum 
Corynebacterium pilosum 

145 Corynebacterium propinquum 
Corynebacterium 
pseudodiphtheriticum 
Corynebacterium pseudotuberculosis 
Corynebacterium pyogenes 

150 Corynebacterium renale 

Corynebacterium renale group 
Corynebacterium seminale 
Corynebacterium species 
Corynebacterium striatum {C. 

155 ftavidum) 

Corynebacterium uicerans 



Corynebacterium urealyticum (group 
D2) 

Corynebacterium xerosis 
1 60 Cryptococcus albidus 
Cryptococcus ater 
Cryptococcus cereanus 
Cryptococcus gastricus 
Cryptococcus humicolus 
1 65 Cryptococcus lactaUvorus 
Cryptococcus laurentu 
Cryptococcus tuteolus 
Cryptococcus metibiosum 
Cryptococcus neoformans 
170 Cryptococcus species 
Cryptococcus terreus 
Cryptococcus uniguttulatus 
Debaryomyces hansenil 
Debaryomyces marama 
175 Debaryomyces pofymorphus 
Debaryomyces species 
Dermabacter hominis 
Dermacoccus {Micrococcus) 
nishinomiyaensis 
1 80 Dietzia species 

Edwardsiella hoshinae 
Edwardsiella ictaluri 
Edwardsiella species 
Edwardsiella tarda 
185 Elkenetla corrodens 

Empedobacter brevls {Flavobacterium 
breve) 

Enterobacter aerogenes 

Enterobacter agglomerans 
190 Enterobacter amnigenus 

Enterobacter amnigenus asburiae 

(CDC enteric group 17) 

Enterobacter amnigenus biogroup 1 

Enterobacter amnigenus biogroup 2 
1 95 Enterobacter asburiae 

Enterobacter cancerogenus 

Enterobacter cloacae 

Enterobacter gergoviae 

Enterobacter hormaechei 
200 Enterobacter intennedius 

Enterobacter sakazakii 

Enterobacter species 

Enterobacter taylorae 

Enterobacter taylorae (CDC enteric 
205 group 19) 

Enterococcus {Streptococcus) 

coco rum 

Enterococcus {Streptococcus) faecalis 

(Group D) 
210 Enterococcus {Streptococcus) 

faecium{Group D) 

Enterococcus {Streptococcus) 

saccharotytlcus 

Enterococcus avium (Group D) 
215 Enterococcus casseliftavus 

{Steptococcus faecium subsp. 

casseliftavus) 

Enterococcus durans {Streptococcus 

faecium subsp. durans) (Group D) 
220 Enterococcus gaUinarum 

Enterococcus hirae 

Enterococcus malodoratus 

Enterococcus mundtii 

Enterococcus raffinosus 
225 Enterococcus species 

Erwinia amylovora 

Erwinia carotovora 

Erwinia carotovora subsp. atroseptica 

Erwinia carotovora subsp. 
230 betavasculorum 

Erwinia carotovora subsp. carotovora 

Erwinia chrysanmemi 

Erwinia cypripedil 

Erwinia maflotivora 
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Erwinia nigrifluens 
Erwinia quercina 
Erwinla rhapontici 
Erwinia rvbrifaciens 
5 Erwinla salicis 
Erwinia species 
Erysipelothrix musiopathiae 
Erysipelothrix species 
Escherichia btattae 
10 Escherichia coli 

Escherichia coii A-D 
Escherichia coii 0157:H7 
Escherichia fergusonii 
Escherichia hermannii 
1 5 Escherichia species 
Escherichia vulneris 
Eubacterium aerofaciens 
Eubacterium alactolyticum 
Eubacterium lentum 
20 Eubacterium limosum 
Eubacterium species 
Ewingelta americana 
FHobasidiella neotonnans 
Filobasidium floriforme 
25 Filobasidium uniguttulatum 
Flavimonas oryzihabitans 
Flavobacterium gleurn^ 
riavooacterium indoiogenes 
Fla voba cterium odoratum 
30 Flavobacterium species 
Franciseila novidda 
Franciseila philomiragla 
Francisella species 
Franciseila tularensis 
35 Fusobacterium mortiterum 
Fusobacterium necrogenes 
Fusobacterium necrophorum 
Fusobacterium nucleatum 
Fusobacterium species 
40 Fusobacterium varium 
Gaffkya species 
Gardnereila vaginalis 
Gemella haemotysans 
Gemella morblllorum 
45 Gemella species 

Geotrichum candidum 
Geotrichum fermentans 
Geotrichum peniclllamm 
Geotrichum penicillatum 
50 Geotrichum species 
Gordons species 
Haemophilus aegyptius 
Haemophilus aphrophilus 
Haemophilus ducreyi 
55 Haemophilus haemog/obinophilus 
Haemophilus haemolyticus 
Haemophilus influenzae 
Haemophilus influenzae biotype I 
Haemophilus influenzae biotype II 
60 Haemophilus influenzae biotype 111 
Haemophilus influenzae biotype IV 
Haemophilus influenzae biotype V 
Haemophilus Influenzae biotype VI 
Haemophilus influenzae biotype VII 
65 Haemophilus influenzae biotype VIII 
Haemophilus paragallinarum 
Haemophilus parahemolyticus 
Haemophilus paralnfluenzae 
Haemophilus paralnfluenzae biotype i 
70 Haemophilus paralnfluenzae biotype II 
Haemophilus paralnfluenzae biotype 
III 

Haemophilus paralnfluenzae biotype 

7 5 Haemophilus paralnfluenzae biotype V 
Haemophilus paralnfluenzae biotype 
VI 

Haemophilus parainfluenzae biotype 



VII 

80 Haemophilus parainfluenzae biotype 
VIII 

Haemophilus peraphrohaemotyticus 
Haemophilus paraphrophilus 
Haemophilus segnls 
85 Haemophilus somnus 
Haemophilus species 
Hafnia alvei 

Hanseniaspora guilliermondii 

Hansenlaspora uvarum 
90 Hanseniaspora valbyensis 

Hansenula anomala 

Hansenula holstii 

Hansenula potymorpha 

Helicobacter (Campylobacter) cinaedl 
95 Helicobacter (Campylobacter) 

fennelliae 

Helicobacter (Campylobacter) pylon 

Issatchenkia orientalis 

Kingeila denitrificans 
1 00 Kingeila Indologenes 

Kingeila kingae 

Kingeila species 

Klebsiella omithinotytlca 

Klebsiella oxytoca 
1 05 Klebsiella planticola 

Klebsiella pneumoniae subsp. 

ozaenae 

Klebsiella pneumoniae subsp. 
pneumoniae 
110 Klebsiella pneumoniae subsp. 

rhinoscleromatis 

Klebsiella species 

Klebsiella terrigena 

Ktoeckera apiculata 
115 Kloeckera ap is 

Kloeckera /aponica 

Kloeckera species 
■ -Kluyvera ascorbata 

Kluyvera cryocrescens 
1 20 Kluyvera species 

Kluyveromyces lactis 

Kluyveromyces marxianus 

Kluyveromyces thermotolerans 

Kocuria (Micrococcus) kristinae 
1 25 Kocuria (Micrococcus) rosea 

Kocuria(Micrococcus) varians 

Koserelta trabulsii 

Kytococcus (Micrococcus) sedentarius 

Lactobacillus (Weissalla) viridescens 
130 Lactobacillus A 

Lactobacillus acidophilus 

Lactobacillus B 

Lactobacillus brevis 
, Lactobacillus buchneri 
135 Lactobacillus casei 

Lactobacillus easel subsp. casei 

Lactobacillus casei subsp. lactosus 

Lactobacillus casei subsp. rhamnosus 

Lactobacillus catenaformls 
1 40 Lactobacillus cellobiosus 

Lactobacillus collinoides 

Lactobacillus coprophilus 

Lactobacillus crispatus 

Lactobacillus curvatus 
145 Lactobacillus delbrueckii subsp. 

bulgaricus 

Lactobacillus delbrueckii subsp. 
delbrueckii 

Lactobacillus delbrueckii subsp. lactis 
150 Lactobacillus fermentum 

Lactobacillus fructivorans 

Lactobacillus helveticus 

Lactobacillus helveticus subsp. jugurti 

Lactobacillus jensenii 
155 Lactobacillus lindneri 

Lactobacillus minutus 



Lactobacillus paracasei subsp. 
paracasei 

Lactobacillus pentosus 
1 60 Lactobacillus plantarum 
Lactobacillus salivarius 
Lactobacillus salivarius var. salicinius 
Lactobacillus species 
Lactococcus diacitilactis 
165 Lactococcus garvieae 

Lactococcus lactis subsp. cremoris 
Lactococcus lactis subsp. diacitilactis 
Lactococcus lactis subsp. hordniae 
Lactococcus lactis subsp. lactis 
170 Lactococcus plantarum 
Lactococcus raffinolactis 
Leclercia adecarboxylata 
Legionella species 
Leminorella species 
175 Leptospira species 
Leptotrichia buccalis 
Leuconostoc (Weissella) 
paramesenteroides 
Leuconostoc camosum 
1 80 Leuconostoc citreum 
Leuconostoc gelidum 
Leuconostoc lactis 
lBuconostocmecer.iercidas 
Leuconostoc mesenteroides subsp. 
185 cremoris 

Leuconostoc mesenteroides subsp. 
dextranicum 

Leuconostoc mesenteroides subsp. 
mesenteroides 
190 Leuconostoc species 
Listeria grayi 
Listeria innocua 
Listeria ivanovii 
Listeria monocytogenes 
195 Listeria murrayl 

Listeria seeligeri - 
Listeria species 
Listeria welshimeri 
Megasphaera elsdenii 
200 Methylobacterium mesophilicum 
Metschnikowia pulcherrima 
Microbacterium species 
Micrococcus luteus 
Micrococcus fylae 
205 Micrococcus species 
Mobiluncus curtisll 
Mobiluncus mufieris 
Mobiluncus species 
Moelterella wisconsensis 
210 Moraxella (Branhamella) catarrhalls 
Moraxella atlantae 
Moraxella bovis 
Moraxella lacunata 
Moraxella nonliquefaciens 
215 Moraxella osloensis 

Moraxella phenylpyruvica 
Moraxella species 
Morganella morganii 
Morganella morganii subsp. morganii 
220 Morganella morganii subsp. slbonii 
Mycobacterium africanum 
Mycobacterium asiaticum 
Mycobacterium avium 
Mycobacterium bovis 
225 Mycobacterium chelonae 
Mycobacterium fortuitum 
Mycobacterium gordonae 
Mycobacterium kansasii 
Mycobacterium malmoense 
230 Mycobacterium marinum 
Mycobacterium phiel 
Mycobacterium scrofulaceum 
Mycobacterium smegmatis 
Mycobacterium species 
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Mycobacterium tuberculosis 
Mycobacterium ulcerans 
Mycobacterium xenopi 
Mycoplasma fermentans 
5 Mycoplasma hominis 
Mycoplasma orale 
Mycoplasma pneumoniae 
Mycoplasma species 
Myrotdes species 
10 Neisseria cinerea 

Neisseria elongata subsp. elongate 
Neisseria flava 
Neisseria flavescens 
Neisseria gonorrhoeae 
1 5 Neisseria lactamica 
Neisseria meningitidis 
Neisseria mucosa 
Neisseria perHava 
Neisseria polysaccharea 
20 Neisseria saprophytes 
Neisseria sicca 
Neisseria subflava 
Neisseria weaveri 
Neisseria weaveri (CDC group M5) 
25 Nocardia species 

Ochrobactrum anthropi 
Oerskovia species 
Oerskovia xanthlneotytica 
Oligefla (Moraxella) urethrals 
30 Oligefla species 
Oligella ureotytica 
Paenibaciilus alvei 
Paenibacilius macerans 
Paenibaciilus potymyxa 
35 Pantoea agglomerans 

Pantoea ananas (Erwinla uredovora) 
Pantoea disperse 
Pantoea species 
Pantoea stewartii 
40 PasteureUa (Haemophilus) avium 
Pasteurella aerogenes 
PasteureUa gallinamm 
Pasteurella haemolytica 
Pasteurella haemotytlcus 
45 Pasteurella muttocida 

Pasteurella multocida SF 
Pasteurella multocida subsp. 
multocida 

Pasteurella multocida subsp. septica 
50 Pasteurella pneumotropica 
Pasteurella species 
Pasteurella ureae 
Pediococcus acidilactici 
Pedlococcus damnosus 

5 5 Pediococcus pentosaceus 
Pediococcus species 
Peptococcus niger 
Peptococcus species 
Peptostreptococcus anaerobius 

60 Peptostreptococcus asaccharolyticus 
Peptostreptococcus indollcus 
Peptostreptococcus magnus 
Peptostreptococcus micros 
Peptostreptococcus parvulus 

65 Peptostreptococcus prevotii 
Peptostreptococcus productus 
Peptostreptococcus species 
Peptostreptococcus tetradius 
Phaecoccomyces exophlaliae 

70 Photobacterium damsetae 
Pichfa (Hansenula) anomala 
Pichia {Hansenula) jadlnii 
Pichia (Hansenula) petersonii 
Pichia angusta (Hansenula 

75 polymorpha) 

Pichia carsonii (P. vinl) 
Pichia etchellsil 
Pichia farinosa 



Pichia femnentans 
80 Pichia membranaefaciens 
Pichia norvegensis 
Pichia ohmeri 
Pichia spartinae 
Pichia species 
85 Plesiomonas shigelloides 

Porphyromonas asaccharofytica 
Porphyromonas endodontalis 
Porphyromonas gingivalis 
Porphyromonas levii 
90 Prevoteila (Bacteroides) buccae 
Prevotella (Bacteroides) buccaJis 
Prevoteila (Bacteroides) corporis 
Prevotella (Bacteroides) dentlcola 
Prevoteila (Bacteroides) loescheii 
95 Prevotella (Bacteroides) oralis 
Prevotella (Bacteroides) disiens 
Prevotella (Bacteroides)oris 
Prevotella bivia (Bacteroides btvius) 
Prevotella intermedia (Bacteroides 
100 intermedlus) 

Prevotella melaninogenica 
(Bacteroides melaninogenicus) 
Prevotella rumlnlcola 
Propionibacterium acnes 
105 Propionibacterium avidum 

Propionibacterium granulosum 
Propionibacterium propionicum 
Propionibacterium species 
Proteus mirabilis 
110 Proteus penneri 
Proteus species 
Proteus vulgaris 
Prototheca species 
Prototheca wickerhamii 
115 Prototheca zopfii 

Providencla alcalifaciens 
Providencia heimbachae 
■ Providencia rettgeri 
Providencla rustlgianii 
120 Providencia species 
Providencia stuartii 
Providencla stuariii urea + 
Pseudomonas (Chryseomonas) 
luteola 

1 25 Pseudomonas acidovorans 

Pseudomonas aeruginosa 

Pseudomonas alcaligenes 

Pseudomonas cepacia 

Pseudomonas chlororaphis (P. 
130 aureofaciens) 

Pseudomonas fiuorescens 

Pseudomonas fiuorescens group 

Pseudomonas mendoclna 

Pseudomonas pseudoalcartgenes 
135 Pseudomonas putida 

Pseudomonas species 

Pseudomonas stutzeri 

Pseudomonas testosteroni 

Pseudomonas vesicularis 
140 Pseudoramibacter {Eubacterium) 

alactotyticus 

Psychrobacter (Moraxella) 

phenyfpynjvicus 

Rahnelta aquatiiis 
145 Ralstonia (Pseudomonas. 

Burkholderia) pickettii 

Rhodococcus (Corynebacterium) equi 

Rhodococcus species 

Rhodosporidlum toruloides 
150 Rhodotorula gfutlnis 

Rhodotorula minuta 

Rhodotorula mucilaginosa (R. rubra) 

Rhodotorula species 

Rickettsia species 
1 55 Rothla dentocariosa 

Saccharomyces cerevisiae 



Saccharomyces exlguus 
Saccharomyces kluyverii 
Saccharomyces species 
160 Sakaguchia dacryoides 

(Rhodosporidium dacryoldum) 
Salmonella arizonae 
Salmonella cholera esuis 
Salmonella enteritidis 
165 Salmonella gallinamm 
Salmonella paratyphi A 
Salmonella paratyphi B 
Salmonella pullorum 
Salmonella species 
170 Salmonella typhi 

Salmonella typhimurium 
Salmonella typhisuis 
Salmonella/Arizona 
Serratia ficaria 
175 Serratia fonticola 
Serratia grimesil 
Serratia tiquefacians 
Serratia marcescens 
Serratia odorifera 
1 80 Serratia odorifera type 1 
Serratia odorifera type 2 
Serratia ptymuthica 
Serratia proteamaculans 
Serratia proteamaculans subsp. 
185 proteamaculans 

Serratia proteamaculans subsp. 
quinovora 
Serratia rubidaea 
Serratia species 
1 90 Shewanella (Pseudomonas, 
Alteromonas) putrefaciens 
Shigella boydli 
Shigella dysenteriae 
Shigella flexneri 
195 Shigella sonnei 
Shigel]a species 
Sphingobacterium multlvorum 
Sphingobacterium species 
Sphingobacterium spiriVrvorum 
200 Sphingobacterium thalpophilum 
Sphlngomonas (Pseudomonas) 
paucfmobilis 

Sporidiobolus salmonlcolor 

Sporobolomyces roseus 
205 Sporobolomyces salmonicotor 

Sporobolomyces species 

Staphylococcus (Peptococcus) 

saccharolytlcus 

Staphylococcus artertae 
210 Staphylococcus aureus 

Staphylococcus aureus (Coaguiase- 

negatrve) 

Staphylococcus auricularis 
Staphylococcus capitis 
215 Staphylococcus capitis subsp. capitis 
Staphylococcus capitis subsp. 
ureolyticus 

Staphylococcus caprae 
Staphylococcus camosus 

220 Staphylococcus caseolyticus 
Staphylococcus chromogenes 
Staphylococcus cohnii 
Staphylococcus cohnii subsp. cohnii 
Staphylococcus cohnii subsp. 

225 urealyticum 

Staphylococcus epidermidis 
Staphylococcus equorum 
Staphylococcus gallinarum 
Staphylococcus haemotytlcus 

230 Staphylococcus hominis 

Staphylococcus hominis subsp. 
hominis 

Staphylococcus hominis subsp. 
novobiosepticus 
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Table 15. Micr rganisms identified by aut mated systems (continued). 



Staphylococcus hyicus 
Staphylococcus intermedius 
Staphylococcus kloosii 
5 Staphylococcus fentus 

Staphylococcus lugdunensis 
Staphylococcus saprophyticus 
Staphylococcus schlelferi 
Staphylococcus sciuri 

1 0 Staphylococcus simulans 
Staphylococcus species 
Staphylococcus wameri 
Staphylococcus xylosus 
Stenotrophomonas (Xanthomonas) 

1 S mattophilia 

Stephanoascus dferrii 
Stomatococcus mucilaginosus 
Streptococcus adcfominimus 
Streptococcus agalactiae 

20 Streptococcus agalactiae (Group B) 
Streptococcus agalactiae hemolytic 
Streptococcus agalactiae non- 
hemolytic 

Streptococcus alactolyticus 
25 Streptococcus anginosus 

Streptococcus anginosus (Group D, 

nonenterococci) 

Stegfocgceusjie^ 

Streptococcus beta-hemolytic non- 
30 group A or B 

Streptococcus beta-hemolytic non- 
group A 

Streptococcus beta-hemolytic 

Streptococcus bovis (Group D, 
35 nonenterococci) 

Streptococcus bovis I 

Streptococcus bovis II 

Streptococcus cants 
^Streptococcus constellatus 
40 Streptococcus constellatus 

(Streptococcus milleri I) 

Streptococcus constellatus (vlridans 

Streptococcus) 

Streptococcus downei 
45 Streptococcus dysgatactiae subsp. 

dysgatactiae 

Streptococcus dysgatactiae subsp. 
equlslmllls 

Streptococcus equi (Group C/Group G 
50 Streptococcus) 

Streptococcus equi subsp. equi 

Streptococcus equi subsp. 

zooepldemicus 

Streptococcus equinus 
55 Streptococcus equinus (Group D, 

nonenterococci) 

Streptococcus equisimifis 

Streptococcus equlsfmulis (Group 

C/Group G Streptococcus) 



60 Streptococcus Gamma (non)- 
hemolytic 

Streptococcus gordonii 

Streptococcus Group B 

Streptococcus Group C 
65 Streptococcus Group D 

Streptococcus Group E 

Streptococcus Group F 

Streptococcus Group G 

Streptococcus Group L 
70 Streptococcus Group P 

Streptococcus Group U 

Streptococcus intermedius 

Streptococcus intermedius 

(Streptococcus milleri li) 
7 5 Streptococcus intermedius (viridans 

Streptococcus) 

Streptococcus milleri group 

Streptococcus mitis 

Streptococcus mitis (vlridans 
80 Streptococcus) 

Streptococcus mitis group 

Streptococcus mutans 

Streptococcus mutans (viridans 

Streptococcus) 
85 Streptococcus oralis 

Streptococcus parasanguls 

Streptococcus ) pneumoniae 

Streptococcus pordnus 

Streptococcus pyogenes 
90 Streptococcus pyogenes (Group A) 

Streptococcus salivarius 

Streptococcus salivarius (vlridans 

Streptococcus) 

Streptococcus salivarius subsp. 
95 salivarius 

Streptococcus salivarius subsp. 

thermophiius 

Streptococcus sanguis 

Streptococcus sanguis I (viridans 
1 00 Streptococcus) 

Streptococcus sanguis II 

Streptococcus sanguis II (viridans 

Streptococcus) 

Streptococcus sobrinus 
1 05 Streptococcus species 

Streptococcus suis I 

Streptococcus suis II 

Streptococcus uteris 

Streptococcus uteris (viridans 
110 Streptococcus) 

Streptococcus vestibularis 

Streptococcus zooepldemicus 

Streptococcus zooepldemicus (Group 

C) 

115 Streptomyces somaliensis 
Streptomyces species 
Suttonella (King el la) fndologenes 
Tatumella ptyseos 



Tetragenococcus (Pediococcus) 
120 halophilus 

Torulaspora delbrueckii 

(Saccharomyces rosei) 

Torulopsis Candida 

Torulopsis haemuionii 
125 Torulopsis inconspicua 

Treponema species 

Trichosporon asahii 

Trichosporon asteroides 

Trichosporon beigelii 
130 Trichosporon cutaneum 

Trichosporon inkin 

Trichosporon mucoldes 

Trichosporon ovoides 

Trichosporon pullulans 
135 Trichosporon species 

Turicella othidis 

Ureaptasma species 

Ureaplasma urealyticum 

Velllonefia parvula (V. afcalescens) 
140 Veilloneiia species 

Vibrio aiginolyticus 

Vibrio cholerae 

Vibrio damsela 

Vibrio fluvialis 
145 Vibrio fumissif 

Vibrio harveyi 

Vibrio hollisae 

Vibrio metschnlkovii 

Vibrio mimic us 
1 50 Vibrio parahaemolyticus 

Vibrio species 

Vibrio species SF 

Vibrio vulnificus 

Weeksella (Bergeylla) virosa 
155 Weeksella species 

Weeksella virosa 

Williopsis {Hansenufa) satumus 

Xanthomonas campestris 

Xanthomonas species 
1 60 Yarrowia (Candida) lipolytica 

Yersinia aidovae 

Yersinia enterocofitica 

Yersinia enterocolitica group 

Yersinia frederiksenii 
165 Yersinia intermedia 

Yersinia intermedius 

Yersinia kristensenii 

Yersinia pestis 

Yersinia pseudotuberculosis 
1 70 Yersinia pseudotuberculosis SF 

Yersinia ruckeri 

Yersinia species 

Yokenella regensburgei 

Yokenella regensburgei (Koserella 
175 trabulsii) 

Zygoascus hetlenicus 

Zygosaccharomyces species 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplification ( tuf sequences). 

B «« = ™„™=— — — - — Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 Bacterial species: Chlamydia pneumoniae 





630 




5 ' -CGG 


AGC 


TAT CCT 


AGT CGT 


TTC A 


20 


2-23 




629 a 




5 ' -AAG 


TTC 


CAT CTC 


AAC AAG 


GTC AAT A 


20 


146-170 


1 J 


Bacterial 


species : 




Chlamydia trachomatis 








554 




5 ' -GTT 


CCT 


TAC ATC 


GTT GTT 


TTT CTC 


22 


82-105 




555 a 




5 ' -TCT 


CGA 


ACT TTC 


TCT ATG 


TAT GCA 


22 


249-272 


20 


Parasitical 


species 




Cryptosporidium parvum 








79-5 




5 ' -TGG 


1 lu 


TCC CAG 


CCG ATC 


GTT T 


ft fiS 


ijo-i f y 




804 a 




5 ' -CCT 


GGG 


ACG GCC 


TCT GGC 


AT 


865 


664-683 


25 


799 




5 ' -ACC 


TGT 


GAA TAC 


AAG CAA 


TCT 


865 


280-300 




805 a 




5 ' -CTC 


TTG 


TCC ATC 


TTA GCA 


GT 


865 


895-914 




800 




5 ' -GAT 


GAA 


ATC TTC 


AAC GAA 


GTT GAT 


865 


307-330 




806 a 




5 ' -AGC 


ATC 


ACC AGA 


CTT GAT 


AAG 


865 


946-966 


30 






















801 




5 ' -AC A 


ALA 


CCG AGA 


AGA TCC 


CA 


pec 
ODD 






« B 0 3 a --^ - 




5 ' -ACT 


TCA 


GTG-GTA 


AC A- CCA 


GC - 


~ 865 - - 


— 616-635 




802 




5 ' -TTG 


CCA 


TTT CTG 


GTT TCG 


TT 


O £ C 
ODD 


"3*77 "3 Q C 


35 


807 a 




5 '-AAA 


GTG 


GCT TCA 


AAG GTT 


GC 


865 


981-1000 




Bacterial 


species : 




Neisseria gonorrhoeae 








551 




5' -GAA 


GAA 


AAA ATC 


TTC GAA 


CTG GCT A 


126 


256-280 


40 


552 a 




5 ' -TAC 


ACG 


GCC GGT 


GAC TAC 


G 


126 


378-396 




p^cterial 


species : 




Streptococcus agalactiae 








549 




5 ' -GAA 


CGT 


GAT ACT 


GAC AAA 


CCT TTA 


207-210 b 


308-331 c 


45 


550 a 




5 ' -GAA 


GAA 


GAA CAC 


CAA CGT 


TG 


207-210 b 


520-539 c 




Bacterial 


soecies : 




Streptococcus pyogenes 








999 




5 ' -TTG 


ACC 


TTG TTG 


ATG ACG 


AAG AG 


1002 


143-165 


50 


1000 a 




5 ' -TTA 


GTG 


TGT GGG 


TTG ATT 


GAA CT 


1002 


622-644 




1001 




5 ' -AAG 


AGT 


TGC TTG 


AAT TAG 


TTG AG 


1002 


161-183 




1000 a 




5 ' -TTA 


GTG 


TGT GGG 


TTG ATT 


GAA CT 


1002 


622-644 



55 



a These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

b These sequences were aligned to derive the corresponding primer. 
c The nucleotide positions refer to the S. agalactiae tuf sequence fragment 
60 (SEQ ID NO. 209) . 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplification {tuf sequences) (continued). 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



10 



45 



50 



Parasitical species : Trypanosoma. Jbrucei 

82 0 5' -GAA GGA GGT GTC TGC TTA CAC 



864 



513-533 





821 a 


5 ' 


-GGC 


GCA 


AAC GTC ACC ACA 


TCA 






864 


789 


-809 


15 


820 


5' 


-GAA 


GGA 


GGT GTC TGC TTA 


CAC 






864 


513 


-533 




822 a 


5' 


-CGG 


CGG 


ATG TCC TTA ACA 


GAA 






864 


909 


-929 




Parasitical 


spegies 




Trypanosoma. 


cruzi 










20 


794 


5' 


-GAC 


GAC 


AAG TCG GTG AAC 


TT 






840-842 b 


281- 


300 c 




795 a 


5' 


-ACT 


TGC 


ACG CGA TGT GGC 


AG 






840-842 b 


874- 


893 c 




Bacterial oenus 






Bordatolla sp. 












25 


825 


5' 


-ATG 


AGC 


ARC GSA ACC ATC 


GTT 


CAG 




OCT 
OuJ 


1 - 


Zb 




826 


5' 


-TCG 


ATC 


GTG CCG ACC ATG 


TAG 


AAC 


GC 


863 


1342 


-1367 




Funeral qenus 








Candida sp. 














30 


576 


5' 


-AAC 


TTC 


RTC AAG AAG GTY 


GGT 


TAC 


AA 


407-426, 


332- 


357 d 




















428-432 b 








632 a 


5' 


-CCC 


TTT 


GGT GGR TCS TKC 


TTG 


GA 




407-426, 


791- 


813 d 




















428-432 b 






35 


631 


5' 


-CAG 


ACC 


AAC YGA IAA RCC 


ATT 


RAG 


AT 


407-426, 


523- 


548 d 




















428-432 b 








632 a 


5' 


-CCC 


TTT 


GGT GGR TCS TKC 


TTG 


GA 




407-426, 


791- 


813 d 




















428-432 b 






40 


633 


5' 


-CAG 


ACC 


AAC YGA IAA RCC 


ITT 


RAG 


AT 


407-426, 


523- 


548 d 




















428-432 b 








632 a 


5' 


-CCC 


TTT 


GGT GGR TCS TKC 


TTG 


GA 




407-426, 


791- 


813 d 




















428-432 b 







a These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
b These sequences were aligned to derive the corresponding primer. 
° NO 6 842) le ° tide positions refer to the cruzi tuf sequence fragment (SEQ ID 

d The nucleotide positions refer to the C. albicans tuf(EF-l) sequence fragment 
(SEQ ID NO. 408) . y 
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Annex X: Specific and ubiquitous prim rs for nucleic acid 
amplification ( tuf s quenc s) (continued). 



Originating DNA fragment 



SEQ ID NO . 


Nucleotide 


sequence 








£>£*U ID 
NO . 


Nucleotide 
pos i t ion 


Bacterial 


aenus : 




CI os fcx\i dl tun 


S"D • 










796 


5 ' -GGT 


CCA 


ATG CCW CAA ACW 


AGA 






32 , 719- 


32-52 b 
















724,736 a 




797 c 




TAA 


GAA TGG YTT ATC 


TGT 


SKC 


TCT 


32,719- 


320-346 b 
















724,736 a 




808 


5'-GCI 


TTA 


IWR GCA TTA GAA 


RAY 


CCA 




32,719- 


224-247 b 
















724, 736 a 




809 c 


5 ' -TCT 


TCC 


TGT WGC AAC TGT 


TCC 


TCT 




32 , 719- 


337-360 b 
















724,736 a 




810 


5 ' -AG A 




ACA GAT AAR SCA 


TTC 


TTA 




32, 719- 


320-343 b 
















724,736 a 




811 c 


5'-TRA 


ART 


AGA ATT GTG GTC 


TRT 


ATC 


C 


32,719- 


686-710 b 
















724 , 736** 




Bacterial 


genus : 




CoryneJbacterium sp. 








545 


5' -TAG 


ATC 


CTB GTY GCI CTI 


AAC 


AAG 


mp 




go i i a d 


1 546 c 


5 ' -CCR 


CGI 


CCG GTR ATG GTG 


AAG 


AT 




34-44, 662 a 


350-372 d 


Parasitica,! genus: 




Entamoeba sp 












703 


5 ' -TAT 


GGA 


AAT TCG AAA CAT 


CT 






512 


38-57 


704 c 


5 ' -AGT 


GCT 


CCA ATT AAT GTT 


GG 






512 


442-461 


703 


5 ' -TAT 


GGA 


AAT TCG AAA CAT 


CT 






512 


38-57 


705 c 


5 ' -GTA 


CAG 


TTC CAA TAC CTG 


AA 






512 


534-553 


703 


5 ' -TAT 


GGA 


AAT TCG AAA CAT 


CT 






512 


38-57 


706 c 


5 ' -TGA 


AAT 


CTT CAC ATC CAA 


CA 






512 


768-787 


793 


5 ' -TTA 


TTG 


TTG CTG CTG GTA 


CT 






512 


149-168 


704 c 


5 ' -AGT 


GCT 


CCA ATT AAT GTT 


GG 






512 


442-461 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the C. perfringens tuf sequence fragment 
(SEQ ID NO. 32) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the C. dlphtheriae tuf sequence fragment 
(SEQ ID NO. 662) ... 
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Annex I: Specific and ubi<ju items primers for nucleic acid 
amplification (tuf sequences) (continu d) • 

Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 


Bacterial 


aenus : 




Enterococcus sp. 








656 


5 ' - AAT 


TAA 


TGG CTG CAG TTG AYG A 


58-72 a 


273-294 b 




657 c 


5 ' -TTG 


TCC 


ACG TTC GAT RTP TTP A 


RQ_7*>a 

JO 1 Ct 


JjO-3 / / 


15 


656 


5' -AAT 


TAA 


TGG CTG CAG TTG AYG A 


58-72 a 


273-294 b 




271 c 


5' -TTG 


TCC 


ACG TTG GAT RTC TTC A 


58-72 a 


556-577 b 




1137 


5 ' -AAT 


TAA 


TGG CTG CWG TTG AYG AA 


58-72 a 


273-295 b 


20 


1136° 


5 ' -ACT 


TGT 


CCA CGT TSG ATR TCT 


58-72 a 


559-579 b 




Parasitical oenus 




Gia.rd.ia. sp . 








816 


5 ' -GCT 


ACG 


ACG AGA TCA AGG GC 


513 


305-324 


25 


819 c 


5 ' -TCG 


ALjl_ 


1 lv_ lOU AWa AAG AG 


C 1 o 


895-914 




817 


5 ' -TGG 


AAG 


AAG GCC GAG GAG TT 


513 


355-374 




818 c 


5 ' -AGC 


CGG 


GCT GGA TCT TCT TC 


513 


825-844 


jyj 


Parasitical aenus : 




Leishmanial sp. 


















701 


5 ' -GTG 


TTC 


ACG ATC ATC GAT GCG 


514-526 a 


94-114 d 




702 c 


5'-CTC 


TCG 


ATA TCC GCG AAG CG 


514-526 a 


913-932 d 


35 


Bacterial 


genu? : 




Staphylococcus sp. 






553 


5 ' -GGC 


CGT 


GTT GAA CGT GGT CAA ATC A 


176-203 a 


313-337 e 




575 c 


5 ' -TIA 


CCA 


TTT CAG TAC CTT CTG GTA A 


176-203 3 


653-677 e 


40 


553 


5' -GGC 


CGT 


GTT GAA CGT GGT CAA ATC A 


176-203 9 


313-337e 


707 c 


5 ' -TWA 


CCA 


TTT CAG TAC CTT CTG GTA A 


176-203 a 


653-677 e 




Bacterial 


aenus : 




Streptococcus sp. 








547 


5 ' -GTA 


CAG 


TTG CTT CAG GAC GTA TC 


206-231 a 


372-394 f 


45 


548 c 


5 ' -ACG 


TTC 


GAT TTC ATC ACG TTG 


206-231 a 


548-568 f 



50 



55 



a These sequences were aligned to derive the corresponding primer, 
b The nucleotide positions refer to the E. durans tuf sequence fragment (SEQ ID 
NO. 61) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the L. tropica tuf(EF-l) sequence fragment 
(SEQ ID NO. 526) . 

e The nucleotide positions refer to the S. aureus tuf sequence fragment (SEQ ID 
NO. 179) . 

f The nucleotide positions refer to the S. agalactiae tuf sequence fragment 
(SEQ ID NO. 209) . 



127 



CA 02307010 2000-05-19 



5 



30 



Annex I: Specific and ubiquitous primers for nucleic acid 
amplif ication ( tuf" sequenc s) (continued). 

~ * Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 Parasitical genus : Trypanosoma sp. 

823 5 ' -GAG CGG TAT GAY GAG ATT GT 529,840- 493-512 b 

842, 864 a 

824 c 5'-GGC TTC TGC GGC ACC ATG CG 529,840- 1171-1190 b 

15 842,864 a 

Bacterial family : AOrcoJbactejriaceae 

539 5'-CCI TAC ATC CTB GTY GCI CTI AAC AAG 122 85-111 

20 540 c 5 ' -GGD GCI TCY TCR TCG WAI TCC TG 122 18T-203 

Bacterial group : JEnterol>actex*iaceae group 

933 5'-CAT CAT CGT ITT CMT GAA CAA RTG 78, 103,146, 390-413 d 

25 168,238,698 a 

934 c 5'-TCA CGY TTR RTA CCA CGC AG I AGA 78,103,146, 831-854 d 

168,238, 698 a 



Parasiti cal family: Tryp&nosom&tldae family 



923 5'-GAC GCI GCC ATC CTG ATG ATC 511,514-526, 166-188 e 

529,840-842, 

864 a 

924 c 5 ' -ACC TCA GTC GTC ACG TTG GCG 511,514-526, 648-668 e 

35 529,840-842, 

864 a 

925 5' -AAG CAG ATG GTT GTG TGC TG 511,514-526, 274-293 e 

529, 840-842, 

40 864 a 

926 c 5' -CAG CTG CTC GTG GTG CAT CTC GAT 511,514-526, 676-699 e 

529, 840-842, 

864 a 

45 

a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the T, Jbrucei tuf (EF-1) sequence fragment 
(SEQ ID NO. 864) . 

c These sequences are from the complementary DNA strand of the sequence of the 
50 originating fragment given in the Sequence Listing. 

d The nucleotide positions refer to the E. coll tuf sequence fragment (SEQ ID 
NO. 698) . 

e The nucleotide positions refer to the L. tropica tuf sequence fragment (SEQ 
ID NO. 526) . 
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Annex Is Specific and ubiquitous primers for nucleic acid 
amplification ( tuf sequences) (continued). 

" Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Parasitical family : Trypanosoma tidae family (continued) 

927 5'-ACG CGG AGA AGG TGC GCT T 511,514-526, 389-407 b 

529,840-842, 
864 a 

928 c 5'-GGT CGT TCT TCG AGT CAC CGC A 511,514-526, 778-799 b 



529, 840-842, 
864 a 



Bacterial grout) : Pseudomonads group 



541 5'-GTK GAA ATG TTC CGC AAG CTG CT 153-155 a 476-498 d 

542 c 5 ' -CGG AAR TAG AAC TGS GGA CGG TAG 153-155 a 679-702 d 

541 5'-GTK GAA ATG TTC CGC AAG CTG CT 153-155 a 476-498 d 

544c 5'-AYG TTG TCG CCM GGC ATT MCC AT 153-155 a 749-771 d 

Universal primers 

636 5 '-ACT GGY GTT GAI ATG TTC CGY AA 7,54,78, 470-492 e 

100, 103, 159, 

209, 224, 227 b 

637 a 5'-ACG TCA GTI GTA CGG AAR TAG AA 7,54,78, 692-714 e 

100,103,159, 
209,224,227° 

638 5' -CCA ATG CCA CAA ACI CGT GAR CAC AT 7,54,78, 35-60 f 

100,103,159, 

209, 224, 227 b 

639a 5'-TTT ACG GAA CAT TTC WAC ACC WGT IAC A 7,54,78, 469-496 f 

100,103,159, 
209,224,227 b 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the L . tropica tuf (EF-1) sequence fragment 
(SEQ ID NO. 526) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d The nucleotide positions refer to the P. aeruginosa tuf sequence fragment 

(SEQ ID NO. 153) . 

e The nucleotide positions refer to the E. coli tuf sequence fragment (SEQ ID 
NO. 78) . 

f The nucleotide positions refer to the B . cereus tuf sequence fragment (SEQ ID 
NO. 7) . 
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Annex X: 



Specific and ubiquitous primers for 
amplif ication (tuf sequences) (continued), 



nucl ic acid 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 

NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



643 



5 ' -ACT GGI GTI GAR ATG TTC CGY AA 



644* 



5 ' —AGS TCI ST I GTI CKG AAR T'AG AA 



1,3, 4,7, 12, 
13, 16,49, 54, 
72,78,85, 88, 
91,94,98,103, 
108, 112, 115, 
116,120,121, 
126, 128, 134, 
136,146,154, 
159,179,186, 
205,209,212, 

224, 238 a 

1,3,4, 7 , 13 , 
13, 16,49,54, 
72,78,85,88, 
91,94,98,103, 
108,112,115, 
116, 120,121, 
126, 128,134, 
136,146,154, 
159,179,186, 
205,209,212, 

224,238 a 



470-492 b 



6'92-~/T4"~ 



a These sequences were aligned to derive the corresponding primer. 
35 b The nucleotide positions refer to the E. coli tuf sequence fragment (SEQ ID 
NO. 78) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplif ication {tuf s qu nces) (continued). 

— — Originating DNA fragment 

J SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

10 643 5 ' -ACT GGI GTI GAR ATG TTC CGY AA 1,3,4,7,12, 470-492 b 

13,16,49,54, 
72,78,85,88, 
91,94,98,103, 
108,112,115, 

15 116,120,121, 

126,128,134, 
136,146,154, 
159,179,186, 
205,209,212, 

20 224,238 a 

645 c 5'-ACG TCI GTI GTI CKG AAR TAR AA 1,3,4,7,12, 692-714 b 

13,16,49,54, 
72,78, 85, 88, 
91,94, 98,103, 

25 108,112,115, 

116, 120, 121, 
126,128,134, 
136,146,154, 
159,179,186, * 

30 205,209,212, 

224,238 a 

646 5'-ATC GAC AAG CCI TTC YTI ATG SC 2,13,82 317-339 d 

122,145 a 

35 647 c 5'-ACG TCC GTS GTR CGG AAG TAG AAC TG 2,13,82 686-711 d 

122,145 a 

646 5'-ATC GAC AAG CCI TTC YTI ATG SC 2,13,82 317-339 d 

122,145 a 

40 648 c 5'-ACG TCS GTS GTR CGG AAG TAG AAC TG 2,13,82 686-711 d 

122,145 a 



45 a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the E. coli tuf sequence fragment (SEQ ID 
NO. 78) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
50 d The nucleotide positions refer to the A. meyeri tuf sequence fragment (SEQ ID 
NO. 2) 
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Annex I: Specific and ubiquitous primers for nucleic acid 
amplif ication ( tuf sequences) (continued). 

~ ~ — Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Universal primers (continued) 



649 
650 c 


5' 
5' 


-GTC 
-TTA 


CTA 
CGG 


TGC 
AAC 


CTC 
ATY 


ARA 
TCA 


CWC 
ACA 


GIG 
CCI 


AGC 
GT 


AC 


8, 86, 141, 143 a 
8, 86, 141, 143 a 


33-58 b 
473-495 b 


636 

cciC 
D jl 


5' 
5' 


-ACT 
-TGA 


GGY 
CGA 


GTT 
CCA 


GAI 

CCI 


ATG 
TCY 


TTC 
TCY 


CGY 
TTY 


AA 
TTC 


A 


8, 86, 141, 143 a 
8, 86, 141,143 s 


473-495 b 
639-663 b 




Sequencing primers 
















556 
557 c 


5 1 
5' 


-CGG 
-CCM 


CGC 
AGG 


NAT 
CAT 


CYT 
RAC 


SGT 
CAT 


CTC 


man 

GGT 


G 






SS8 d 
668 d 


3GS-326 
1047-1068 


694 
557c 


5' 
5' 


-CGG 
-CCM 


CGC 
AGG 


IAT 
CAT 


CYT 
RAC 


SGT 
CAT 


TGT 
CTC 


TGC 
GGT 


G 






668 d 
668 d 


306-326 
1047-1068 


664 
652 c 


5' 
5' 


-AAY 
-CCW 


ATG 
AYA 


ATI 
GTI 


ACI 
YKI 


GGI 
CCI 


GCI 
CCY 


GCI 
TCY 


CAR 
CTI 


ATG 
ATA 


GA 


619 d 
619 d 


604-632 
1482-1508 


664 
561 c 


5' 
5' 


-AAY 
-AC I 


ATG 
GTI 


ATI 
CGG 


ACI 
CCR 


GGI 
CCC 


GCI 
TCA 


GCI 
CGG 


CAR 
AT 


ATG 


GA 


619 d 
619 d 


604-632 
1483-1505 


543 
660° 


5 • 
5 ' 


-ATC 
-GTA 


TTA 
GAA 


GTA 
TTG 


GTT 
AGG 


TCT 
ACG 


GCT 
GTA 


GCT 
GTT 


GA 
AG 






607 
607 


8-30 
678-700 


658 
659 c 


5' 
5' 


-GAT 
-GCT 


YTA 
TTT 


GTC 
TGI 


GAT 
GTT 


GAT 
TCW 


GAA 
GGT 


GAA 
TTR 


TT 
AT 






621 
621 


116-138 
443-465 


658 
661 c 


5 ' 
5' 


-GAT 
-GTA 


YTA 
GAA 


GTC 
YTG 


GAT 
TGG 


GAT 
WCG 


GAA 
ATA 


GAA 
RTT 


TT 
RT 






621 
621 


116-138 
678-700 


558 
559 c 


5' 
5' 


-TCI 
-CCG 


TTY 
ACR 


AAR 
GCR 


TAY 
AY I 


GCI 
GTY 


TGG 
TGI 


GT 
CKC 


AT 






665 d 
665 d 


157-176 
1279-1301 


813 
559° 


5' 
5' 


-AAT 
-CCG 


CYG 
ACR 


TYG 
GCR 


AAA 

AYI 


TGC 
GTY 


AYC 
TGI 


ACG 
CKC 


A 
AT 






665 d 
665 d 


687-708 
1279-1301 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the B. distasonis tuf sequence fragment 
(SEQ ID NO. 8) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d Sequences from databases . 
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Annex I: Specific and ubiquitous primers for nucl ic acid 
amplification (tuf sequences) (continued). 

~~ Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Sequencing primers (continued) 



558 


5' 


-TCI 


TTY 


AAR 


TAY 


GCI 


TGG 


GT 






665 a 


157-176 


8 15* 5 


5 ' 


-TGG 


TGC 


ATY 


TCK 


Av_K 


VjAC 


rrwp 
1 1 






0 0 -J 


fiftfi -70S 


560 


5' 


-GAY 


TTC 


ATY 


AAR 


AAY 


ATG 


ATY 


AC 




665 a 


289-311 


559 b 


5' 


-CCG 


ACR 


GCR 


AYI 


GTY 


TGI 


CKC 


AT 




665 a 


1279-1301 


653 


5' 


-GAY 


TTC 


ATI 


AAR 


AAY 


ATG 


AT 






665 a 


289-308 


559 b 


5' 


-CCG 


ACR 


GCR 


AYI 


GTY 


TGI 


CKC 


AT 




665 a 


1279-1301 


558 


5' 


-TCI 


TTY 


AAR 


TAY 


GCI 


TGG 


GT 






665 a 


. 157-176 


655 b 


5' 


-CCR 


ATA 


CCI 


CMR 


ATY 


TTG 


TA 






665 a 


754-773 


654 


5 ' 


-TAC 


AAR 


ATY 


KG I 


GGT 


ATY 


GG 






6 65 a 


7 54-773 


559 b 


5' 


-CCG 


ACR 


GCR 


AYI 


GTY 


TGI 


CKC 


AT 




665 a 


1279-1301 


696 


5' 


-ATI 


GGI 


CAY 


RTI 


GAY 


CAY 


GGI 


AAR 


AC 


698 a 


52-77 ^ 


697 b 


5' 


-CCI 


AC I 


GTI 


CKI 


CCR 


CCY 


TCR 


CG 




698 a 


1132-1154 


911 


5' 


-GAC 


GGM 


KKC 


ATG 


CCG 


CAR 


AC 






853 


22-41 


914 b 


5' 


-GAA 


RAG 


CTG 


CGG 


RCG 


RTA 


GTG 






853 


700-720 


912 


5 ' 


-GAC 


GGC 


GKC 


ATG 


CCG 


CAR 


AC 






846 


20-39 


914 b 


5 ' 


-GAA 


RAG 


CTG 


CGG 


RCG 


RTA 


GTG 






846 


692-712 


913 


5 ' 


-GAC 


GGY 


SYC 


ATG 


CCK 


CAG 


AC 






843 


251-270 


915 b 


5 ' 


-AAA 


CGC 


CTG 


AGG 


RCG 


GTA 


GTT 






843 


905-925 


916 


5 ' 


-GCC 


GAG 


CTG 


GCC 


GGC 


TTC 


AG 






846 


422-441 


561 b 


5' 


-AC I 


GTI 


CGG 


CCR 


CCC 


TCA 


CGG 


AT 




619 a 


1483-1505 


664 


5' 


-AAY 


ATG 


ATI 


AC I 


GGI 


GCI 


GCI 


CAR 


ATG GA 


619 a 


604-632 


917 b 


5' 


-TCG 


TGC 


TAC 


CCG 


TYG 


CCG 


CCA 


T 




846 


593-614 



a Sequences from databases . 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex Is Specific and ubicjuitous primers for nucleic acid 
amplification {tuf sequenc s) (continu d) . 



SEQ ID NO. 



Originating DNA fragment 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 




Sequencing primers 


(continued) 










1221 


5' 


-GAY 


ACI 


CCI 


GGI 


CAY 


GTI 


GAY 


TT 




i o *3 na 




J J- *± 




1226 b 


5' 


-GTI 


RMR 


TAI 


CCR 


AAC 


ATY 


TC 






1230 a 


2014 


-2033 


15 


1222 


5' 


-ATY 


GAY 


ACI 


CCI 


GGI 


CAY 


GTI 


GAY 


TT 


I230 a 


289 


-314 


1223 b 


5' 


-AYI 


TCI 


ARR 


TGI 


ARY 


TCR 


CCC 


ATI 


CC 


1230 a 


1408 


-1433 




1224 


5' 


-CCI 


GYI 


HTI 


YTI 


GAR 


CCI 


ATI 


ATG 




1230 a 


1858 


-1881 




1225 b 


5' 


-TAI 


CCR 


AAC 


ATY 


TCI 


SMI 


ARI 


GGI 


AC 


1230 a 


2002 


-2027 


20 






























1227 


5' 


-GTI 


CCI 


YTI 


KCI 


GAR 


ATG 


TTY 


GGI 


TA 


1230 a 


2002 


-2027 




1229 b 


5' 


-TCC 


ATY 


TGI 


GCI 


GCI 


CCI 


GTI 


ATC 


AT 


698 a 


4- 


29 




1228 


5' 


-GTI 


CCI 


YTI 


KCI 


GAR 


ATG 


TTY 


GGI 


TAY GC 


1230 a 


2002 


-2030 


25 


1229 b 


5' 


-TCC 


ATY 


TGI 


GCI 


GCI 


CCI 


GTI 


ATC 


AT 


698 a 


4- 


-29 



30 



a Sequences from databases - 

b These sequences are from the complementary DNA strand of the sequence of the 
originating f ragmen t „giy en in the Sequence Listing. 
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Annex II: Specific and ubi<niitous primers for nucleic acid 
amplification (atpD sequ nces) . 

" Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

v NO. position 



Bacterial species : Streptococcus agalactiae 



627 
625 c 


5' 
5' 


-ATT 
-CGT 


GTC 
TGA 


TAT 
AGA 


AAA 
CAC 


AAT 
GAC 


GGC 
CCA 


GAT 
AAG 


AAG 
TAT 


TC 
CC 


379 
379 


-383 a 
-383 a 


42 
206 


-67 b 
-231 b 


628 
625 c 


5' 
5' 


-AAA 
-CGT 


ATG 
TGA 


GCG ATA. AGT 
AGA CAC GAC 


CAC 
CCA 


AAA 
AAG 


AAG 
TAT 


TA 
CC 


379 
379 


-383 a 
-383 a 


52 
206 


-77 b 
-231 b 


627 
626 c 


5' 
5' 


-ATT 
-TAC 


GTC 
CAC 


TAT 
CTT 


AAA 
TTA 


AAT 
AGT 


GGC 
AAG 


GAT 
GTG 


AAG 
CTA 


TC 
AT 


379 
379 


-383 a 
-383 a 


42 
371 


-67 b 
-396 b 


628 
626 c 


5' 
5' 


-AAA 
-TAC 


ATG 
CAC 


GCG 
CTT 


ATA 
TTA 


AGT 
AGT 


CAC 
AAG 


AAA 
GTG 


AAG 
CTA 


TA 
AT 


379 
379 


-383 a 
-383 a 


52 
371 


-77 b 
-396 b 



Bacterial genus: Candida sp. 

634 5'-AAC ACY GTC AGR RCI ATT GCY ATG GA 460-472, 101-126 d 

474-478 a 

635C 5 ' -AAA CCR GTI ARR GCR ACT CTI GCT CT 460-472, 617-642 d 

474-478 a 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the S. agalactia^ atpD sequence fragment 
I SEQ ID NO. 380) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d The nucleotide positions refer to the C. albicans atpD sequence fragment (SEQ 

ID NO. 460) . 
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Ann x II: Specific and ubiquitous primers for nucleic acid 
amplification (atpD sequences) (continued) . 

— — — — — — — — ' — — " Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



35 



40 



45 



50 



562 



Universal primers 

5 '-CAR ATG RAY GAR CCI CCI GGI GYI MGI ATG 



5 - -GGY TGR TAI CCI ACI GCI GAI GGC AT 



564 



5'-TAY GGI CAR ATG AAY GAR CCI CCI GGI AA 



565 c 



5 ' -GGY TGR TAI CCI ACI GCI GAI GGD AT 



243, 244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 
3 7 9 , 3 3 3 9 

243,244,262, 
264, 280, 284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 
379,393 a 

243,244,262, 
264,280,284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 

243,244,262, 
264, 280, 284, 
291,297,309, 
311,315,317, 
324,329,332, 
334-336,339, 
342,343,351, 
356,357,364- 
366,370,375, 

379,393 a 



528-557* 



687-712 b 



522-550 b 



687-712 1 



a These sequences were aligned to derive the corresponding primer. 
55 b The nucleotide positions refer to the K. pneumoniae atpD sequence fragment 
(SEQ ID NO. 317) . 

c These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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10 



15 



20 



Annex II: Specific and ubi<juitous primers for nucl ic acid 
amplification (atpD s quencee) (continued) . 



SEQ ID NO. 



640 



641 c 



642 



641 c 



Originating DNA fragment 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



Universal primers (continued) 

5 ' -TCC ATG GTI TOY GGI CAR ATG AA 
5'-TGA TAA CCW ACI GCI GAI GGC ATA CG 

5'-GGC GTI GGI GAR CGI ACI CGT GA 
5'-TGA TAA CCW ACI GCI GAI GGC ATA CG 



248,284, 315, 513-535 b 
317,343, 357, 
366, 370,379, 393 a 

248, 284, 315, 684-7 09 b 
317,343,357, 

366, 370, 379, 393 a 

248,284, 315, 438-460 b 
317,343,357, 
366,370,379,393 a 

24 8,284,315, 6 84-709 b 
317,343,357, 

366, 370,379, 393 a 



25 




Sequencing primers 
















566 


5' 


-TTY 


GGI 


GGI 


GCI 


GGI 


GTI 


GGI 


AAR 


AC 


669 d 


445-470 




567° 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 


30 


566 


5' 


-TTY 


GGI 


GGI 


GCI 


GGI 


GTI 


GGI 


AAR 


AC 


669 d 


445-470 


814 


5' 


-GCI 


GGC 


ACG 


TAC 


ACI 


GCC 


TG 






666 d 


901-920 




568 


5' 


-RTI 


ATI 


GGI 


GCI 


GTI 


RTI 


GAY 


GT 




669 d 


25-47 




567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 


35 






























570 


5' 


-RTI 


RYI 


GGI 


CCI 


GTI 


RTI 


GAY 


GT 




672 d 


31-53 




567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 




572 


5' 


-RTI 


RTI 


GGI 


SCI 


GTI 


RTI 


GA 






669 d 


25-44 


40 


567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 




569 


5' 


-RTI 


RTI 


GGI 


SCI 


GTI 


RTI 


GAT 


AT 




671 d 


31-53 




567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 


45 


571 


5' 


-RTI 


RTI 


GGI 


CCI 


GTI 


RTI 


GAT 


GT 




670 d 


31-53 




567 c 


5' 


-TCR 


TCI 


GCI 


GGI 


ACR 


TAI 


AYI 


GCY 


TG 


669 d 


883-908 



a These sequences were aligned to derive the corresponding primer. 
50 b The nucleotide positions refer to the K. pneumoniae atpD sequence fragment 
(SEQ ID NO. 317) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d Sequences from databases . 
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Ann x XX; 



Specific and ubiquitous primers for nucleic 
amplification (atpD s quences) (continued). 



acid 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Sequencing primers (continued) 



700 




5' 


-TIR 


TIG 


AYG 


TCG 


ART 


TCC 


CTC 


ARG 




c c o a 


J o — O X 


567 b 




5 ' 


-TCR 


TCI 


GCI 


GGI 


ALK 




A VT 


rzcv 


i \j 


669 a 


883-908 


568 




5' 


-RTI 


ATI 


GGI 


GCI 


GTI 


RTI 


GAY 


GT 




669 a 


25-47 


573 b 




5' 


-CCI 


CCI 


ACC 


ATR 


TAR 


AAI 


GC 






o o o 




574 




5' 


-ATI 


GCI 


ATG 


GAY 


GGI 


ACI 


GAR 


GG 




666 a 


283-305 


573 b 




5' 


-CCI 


CCI 


ACC 


ATR 


TAR 


AAI 


GC 






c c c a 
bob" 


i 4Ct i ARA 


574 




5' 


-ATI 


GCI 


ATG 


GAY 


GGI 


ACI 


GAR 


GG 




666 a 


283-305 


708 b 




5' 


-TCR 


TCC 


ATI 


CCI 


ARI 


ATI 


GCI 


ATI 


AT 


666 a 




681 




5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




685 


694-716 


682 b 




5 ' 


-GTI 


AC I 


CaGY 


TCY 


TL.K 




rprpT 
111 


PPT 
V- V, X 


CC 


686 


1177-1202 


681 




5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




bob 


ezQ a n 1 
o y ^t — / x o 


683 b 




5 ' 


-GTI 


AC I 


GGI 


TCI 


SWI 


AWR 


TCI 


CCI 




OOJ 


X X O U X -t- \J — > 


681 




5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




685 


694-716 


6 99.1.". 




5 -,- 


-GTI 


-AC I 


GGY~ 


TCY" 


~TYR~ 


-ARR- 


TTI 


CCI 


CC 


686 


1177-1202 


681 




5' 


-GGI 


SSI 


TTY 


GGI 


ISI 


GGI 


AAR 


AC 




685 


694-716 


812 b 




5' 


-GTI 


ACI 


GGI 


TCY 


TYR 


ARR 


TTI 


CCI 


CC 


685 


1180-1205 


1213 




5' 


-AAR 


GGI 


GGI 


ACI 


GCI 


GCI 


ATH 


CCI 


GG 


714 a 


697-722 


1212 b 




5' 


-CCI 


CCI 


RGI 


GGI 


GAI 


ACI 


GCW 


CC 




714 a 


1189-1211 


1203 




5' 


-GGI 


GAR 


MGI 


GGI 


AAY 


GAR 


ATG 






709 a 


724-744 


1207 b 




5' 


-CCI 


TCI 


TCW 


CCI 


GGC 


ATY 


TC 






709 a 


985-1004 


1204 




5' 


-GCI 


AAY 


AAC 


ITC 


IWM 


YAT 


GCC 






709 a 


822-842 


1206 b 




5' 


-CKI 


SRI 


GTI 


GAR 


TCI 


GCC 


A 






709 a 


926-944 


1205 




5' 


-AAY 


ACI 


TCI 


AWY 


ATG 


CCI 


GT 






709 a 


826-845 


1207 b 




5' 


-CCI 


TCI 


TCW 


CCI 


GGC 


ATY 


TC 






709 a 


985-1004 



a Sequences from databases . 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex III: Internal probes for nucleic acid hybridization and 
specific detection of tuf s quences. 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



10 



15 



20 



25 



30 



35 



40 



45 



pacterial species: 

577 5 ' -CAT 
Bacterial species : 

578 5 ' -CAT 
Bacterial species : 



580 
603 
1174 



5 ' -GCT 
5 ' -GGT 
5 ' -GAA 



Bacterial species: 
602 5 ' -AAG 

Bacterial species : 
604 5' -GGT 

Bacter ial species : 
579 5 ' -GAA 



Candida albicans 

GAT TGA ACC ATC CAC CA 

Candida duhlinienais 

GAT TGA AGC TTC CAC CA 

EntorococcuB faecal is 

AAA CCA GCT ACA ATC ACT CCA C 
ATT AAA GAC GAA ACA TC 
CGT GGT GAA GTT CGC 

Entmrococcus faacium 

TTG AAG TTG TTG GTA TT 

Enterococcus gallinarum 

GAT GAA GTA GAA ATC GT 

Escherichia coli 

GGC CGT GCT GGT GAG AA 



407-411 a 406-425 b 



412, 414-415 3 418-437 c 



62-63, 607 a 
62-63, 607 a 
62-63, 607 a 



64, 608 a 



66, 60S! a 



584-608 d 
440-459 d 
398-415 d 



426-445 e 



419-438 1 



78 



503-522 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the C. albicans tuJT(EF-l) sequence fragment 
(SEQ ID NO. 408) . 

c The nucleotide positions refer to the C. dubliniensis tuf(EF-l) sequence 

fragment (SEQ ID NO. 414) . 
d The nucleotide positions refer to the E. faecalis tuf sequence fragment (SEQ 

ID NO. 607) . 

e The nucleotide positions refer to the E . faecium tuf sequence fragment (SEQ 
ID NO. 608) . 

f The nucleotide positions refer to the E. gallinarum tuf sequence fragment 
(SEQ ID NO. 609) . 
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15 



Annex III: Int mal prob s for nucleic acid hybridization and 
specific det ction of tuf s qu nces (continued) . 

— ^ Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

* NO. position 



10 Bacterial species : Haemophilus influenzae 

581 5'-ACA TCG GTG CAT TAT TAC GTG G 610 a 551-572 b 

Ranterial species : Staphylococcus aureus 

584 5'-ACA TGA CAC ATC TAA AAC AA 176-180 c 369-388<* 

585 5' -ACC ACA TAC TGA ATT CAA AG 176-180° 525-544 

586 5'-CAG AAG TAT ACG TAT TAT CA 176-180° 545-564 

5 87 5'-CGT ATT ATC AAA AGA CGA AG 176-180° 555-574° 
20 588 5'-TCT TCT CAA ACT ATC GTC CA 176-180° 593-612° 

Racterial species : Staphylococcus epidexmldls 

589 5'-GCA CGA AAC TTC TAA AAC AA 185,611° 445-464 e 

25 590 5' -TAT ACG TAT TAT CTA AAG AT 185,611° 627-646 e 

591 5'-TCC TGG TTC TAT TAC ACC AC 185,611° 586-605 e 

592 5' -CAA AGC TGA AGT ATA CGT AT 185,611° 616-635 e 

593 5' -TTC ACT AAC TAT CGC CCA CA 185,611° 671-690 e 

30 Bacterial"" speciesV — Staphylococcus haamolytxcus 

594 5' -ATT GGT ATC CAT GAC ACT TC 186,188-190° 437-456 f 

595 5'-TTA AAG CAG ACG TAT ACG TT 186,188-190° 615-634 f 

35 Bacterial species : staphylococcus homlnls 

596 5'-GAA ATT ATT GGT ATC AAA GA 191,193-196° 431-450° 

597 5 ' -ATT GGT ATC AAA GAA ACT TC 191,193-196° 437-456° 

598 5'-AAT TAC ACC TCA CAC AAA AT 191,193-196° 595-614° 

40 , _ 

a Sequences from databases . 

b The nucleotide positions refer to the H. influenzae tuf sequence fragment 
(SEQ ID NO. 610) . 

45 ° These sequences were aligned to derive the corresponding probe. 

d The nucleotide positions refer to the S. aureus tuf sequence fragment (SEQ ID 
NO. 179) . 

e The nucleotide positions refer to the S. epidermidis tuf sequence fragment 
(SEQ ID NO. 611) . 

50 f The nucleotide positions refer to the S. haemolyticus tuf sequence fragment 
(SEQ ID NO. 186) . 

ST The nucleotide positions refer to the S. hominls tuf sequence fragment (SEQ 
ID NO. 191) . 



140 



CA 02307010 2000-OS-19 



Annex III: Internal probes for nucleic acid hybridization and 
specific detection of tuf sequences (continu d) . 

~"~ — ~ ~ — — ~" ~~ - Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 Bacterial species : 



Staphylococcus saprophyticus 



15 



20 



25 



35 



40 



45 



50 



55 



599 
600 
601 
695 



5'-CGG TGA AGA AAT CGA AAT CA 
5' -ATG CAA GAA GAA TCA AGC AA 
5'-GTT TCA CGT GAT GAT GTA CA 
5'-GTT TCA CGT GAT GAC GTA CA 



Bacteria,! species: 



Streptococcus agalactiae 



582 c 
583 c 
1199 



5 ' -TTT CAA CTT CGT CGT TGA CAC GAA CAG T 



198-200 a 
198-200 a 
198-200 a 
198-200 a 



207-210 a 



-CAA CTG CTT TTT GGA TAT CTT CTT TAA TAC CAA CG 207-210 a 



-GTA TTA AAG AAG ATA TCC AAA AAG C 



Bacterial species: 



Streptococcus pneumoniae 



1201 



5 ' -TCA AAG AAG AAA CTA AAA AAG CTG T 



Bacteria l species : Streptococcus pyogenes 

'30 1200 5' -TCA AAG AAG AAA CTA AAA AAG CTG T 



207-210 a 



971,977, 
979,986 a 



1002 



406-425 b 
431-450 b 
536-555 b 
563-582 b 



404-431 d 
433-467 d 
438-462 d 



513-537 e 



473-497 



Bacterial group: 



Enterococcus casseliflavus-flavescens- 
galllnarum group 



620 
1122 



5 ' -ATT GGT GCA TTG CTA CGT 
5' -TGG TGC ATT GCT ACG TGG 



Bacterial genus: 



Staphylococcus sp. 



605 
606 
1175 
1176 



5 ' -GAA ATG TTC CGT AAA TTA TT 

5' -ATT AGA CTA CGC TGA AGC TG 

5'-GTT ACT GGT GTA GAA ATG TTC 

5' -TAC TGG TGT AGA AAT GTT C 



58,65, 66 a 
58,65,66 a 



176-203 3 
176-203 a 
176-203 a 
176-203 a 



527-544 f 
529-546 f 



403-4229 
420-439^ 
391-4119 
393-4119 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the S. saprophyticus tuf sequence fragment 
(SEQ ID NO. 198) . 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d The nucleotide positions refer to the S. agalactiae tuf sequence fragment 

(SEQ ID NO. 209) . 

e The nucleotide positions refer to the S. pneumoniae tuf sequence fragment 
(SEQ ID NO. 986) . 

f The nucleotide positions refer to the E. flavescens tuf sequence fragment 
(SEQ ID NO. 65) . 

9 The nucleotide positions refer to the S. aureus tuf sequence fragment (SEQ ID 
NO. 179) . 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



Annex IIU Int rnal probes for nucleic acid hybridization and 
specific detection of tuf a quences. 

" Originating DNA fragment 



SEO ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial crenus 






JbVX^GpVOCOCClJS 8p • 






1202 


C t 

D 


-GTG 


TTG 


AAA TGT TCC GTA AAC A 


2 06-231, 971, 466-487 b 
977,979, 982-986 a 


funaal 


species : 






Candida albicans 






1156 


5' 


-GTT 


GAA 


ATG CAT CAC GAA CAA TT 


407-412, 624 a 


680-702 c 


fungal 


species : 






Candida albicans and 


trapicalis 




1160 


5' 


-CGT 


TTC 


TGT TAA AGA AAT TAG AAG 


407-412, 
429,624 a 


748-771 c 


funqal 


species : 






Candida dubllniensis 






1166 
1168 


5' 
5' 


-ACG TTA 
-GAA CAA 


AGA ATG TTT CTG TCA A 
TTG GTT GAA GGT GT 


414-415 a 
414-415 a 


750-771 d 
707-726 d 


funaal. 


species : 






Candida glabrata 






1158 
1159 


5' 
5' 


-AAG 
-TGA 


AGG 
AGG 


TAA TGT CTG TGG T 
TTT GCC AGG TGA 


417 
417 


781-799 
718-735 


funaal 


species : 






Candida krusei 






1161 


5' 


-TCC 


AGG 


TGA TAA CGT TGG 


422 


720-737 


funaal 


species : 






Candida lusitaniae and sniillezmondii 


1162 


5' 


-CAA 


GTC 


CGT GGA AAT GCA 


418,424 a 


682-699 € 


funaal 


species : 






Candida parapsilasis 






1157 


5' 


-AAG 


AAC 


GTT TCA GTT AAG GAA AT 


426 


749-771 


funaal 


species : 






Candida zeylanoides 






1165 


5' 


-GGT 


TTC 


AAC GTG AAG AAC 


432 


713-730 



a These sequences were aligned to derive the corresponding primer. 
b The nucleotide positions refer to the 5. pneumoniae tuf sequence fragment (SEQ ID NO. 
986) . 

c The nucleotide positions refer to the C. alhicans tuf(EF-l) sequence fragment (SEQ ID 
55 NO. 408) . 

d The nucleotide positions refer to the C. dubliniensis tuf(EF-l) sequence fragment { SEQ 
ID NO. 414) . 

e The nucleotide positions refer to the ,C. lusitaniae tuf(EF-l) sequence fragment (SEQ 
ID NO. 424) . 
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Annex III: Internal probes for nucleic acid hybridization and 
sp cific detection of tuf sequences (continued). 

— ' Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



funeral aenus: Candida, sp. 

1163 5 ' -GTT GGT TTC AAC GTT AAG AAC 



1164 



1167 



5 ' -GGT TTC AAC GTC AAG AAC 



5 1 -GTT GGT TTC AAC GT 



407-412,414- 
415,417,418, 

422,429 a 

413,416,420, 
421,424,425, 

426,428, 431 a 

406-426, 428- 

432, 624 a 



728-748* 



740-757° 



728-741* 



a These sequences were aligned to derive the corresponding primer. 

*> The nucleotide positions refer to the C. albicans tuf(EF-l) sequence fragment (SEQ ID 
NO. 408) . 

c The nucleotide positions refer to the C. lusitaniae tuf(EF-l) sequence fragment <SEQ 
ID NO. 424) . 
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Annex XIII: Strategy for the selection of th 
Staphylococcus g nus-specific hybridization 
probe from tuf s quenc s. 







400 




/IOC 




Accession 


s. 


aureus 


G 


TTGAAATGTT 


CCGTAAATTA 


nm* ana 


1 *7 Q 




s. 


aureus 


G 


TTGAAATGTT 


CCGTAAATTA 


^nm ^ ^ 


1/0 




s. 


aureus 


G 


#rwp« K % A RlffllVP 

1 1 UAAAi (*1 1 


wCvTAikATTA 




1 / / 




s . 


aureus 


G 


TTGAAATGTT 


^WS» iiUUiA AJi 




178 




s. 


aureus aureus 


G 


TTGAAATGTT 


CCGTAAATTA 




1DU 




s . 


auricularis 


G 


TAGAAATGTT 


/^rwn » *. ituh r 
tWuiAAAl 1 A 


mm an 


1 Q1 




s. 


capitis capitis 


G 


TAGAAATGTT 


CCGTAAATTA 




1 QO 




M. 


caseolyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


mm a C 7\ 


ioi 

lOJ 




S. 


cohnii 


G 


TAGAAATGTT 


C CGT AAATT A 


TTAbA 


loft 




s. 


epidennidis 


G 


TAGAAATGTT 


C CGT AAATT A 


mm R ps 


1 oc 
lOJ 




s. 


haemolyti cus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


loo 




s. 


haemolyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


i a q 




s. 


haemolyticus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 






s. 


haemolyticus 


\3 


1 AbAAA'ltfl 1 


CCGTAAATTA 


TTAGA 


i fig 
loo 




b . 


homini s 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


196 




S. 


homini s 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


194 




S. 


hominis homini s 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


191 




s. 


hominis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


193 




s. 


hominis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


195 




s. 


lugdunensis 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


197 




s. 


saprophyti cus 


G 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


198 




s. 


s a prophyt-icu s 


G- 


-TAGAAATGTT 


CCGTAAATTA 


TTAGA 


2 00 




s. 


saprbphyfi-cus""'-' 


G" 


TAGAAATGTT 


CCGTAAATTA 


TTAGA 


199 




s. 


sciuri sciuri 


G 


TTGAAATGTT 


CCGTAAATTA 


TTAGA 


201 




s. 


warner i 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


187 




s. 


warner i 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


192 




s. 


warner i 


G 


TAGAAATGTT 


CCGTAAgTTA TTAGA 


202 




s. 


warner i 


G 


TAGAAATGTT 


CCGTAAgTTA 


TTAGA 


203 




B . 


subtilis 


G 


TTGAAATGTT 


CCGTAAgcTt 


CTTGA 




Z99104 


E. 


coli 


G 


TTGAAATGTT 


CCGcAAAcTg 


cTGGA 


78 




L. 


mono cyt ogen es 


G 


TAGAAATGTT 


CCGTAAATTA 


CTAGA 


138* 





Selected sequence for 
genus-specific hybridi- 
zation probe OAAATGTT CCGTAAATTA TT 605 



The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179) . Nucleotides in capitals are identical to the selected sequence 
or match that sequence. Mismatches are indicated by lower-case letters. 

a The SEQ ID NO. refers to previous patent application WO98/20157. 
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Annex XIV; St rat gy for th si ction of Staphylococcus 
saprophyticus- bp cific and of Staphylococcus 
haemolyticus- specific hybridization prob s 
from tuf sequences. 



SEQ ID 



in 

1U 






339 








383 


NO. : 




b . 


cUHTGUS 


AH 


TtGGTGAAGA 


AcTtGAAATC 


ATcGGTtTaC 


ATGACACaTC 


TAA 


179 




c 

O > 


a ux t? us 


Av? 




AgTtGAAATC 


ATcGGTt TaC 


ATGACACaTC 


TAA 


176 




c 


«3 ursus 


Avj 


i. bVWiwMAWi 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACaTC 


TAA 


177 




o 
o . 


Sit /""i lit* 


Avj 




AgTtGAAATC 


ATcGGTtTaC 


ATGACACaTC 


TAA 


178 


i s 
i j 


c 


aiifdiip anyone 

aureus duieuo 


Alj 


rp*. rurppi ja A A& 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACaTC 


TAA 


180 




. 


a UZ"i CU J. ari S 


Atj 


ivbvl «AA«A 


AcrTtGAAATC 


ATcGGTATga 


AaGACggTTC 


AAA 


181 




c 
o - 


CapXXZxS Cap J. uJL £? 


Alj 


rp*. primp 7A.1V/2fi 


AcrTtGAAATC 


ATcGGTATCC 


AcGAaACTTC 


TAA 


182 




M. 


ca s e o 2 vt i c u s 


Alj 




AgTtGAAATC 


ATTGGTtTaa 


cTGAagaacC 


AAA 


183 




S . 


cohni i 


TV P* 

AG 




AcTtGAAATC 


ATcGGTATgC AaSAsSgaTTG 


AJjf-iA 


164 


on 


S . 


spi&GririicH'S 


AG 


TtGGTGAAGA 


Ay A kUAAAl v» 


ATcGGTATgC AcGAaACTTC 


TAA 


185 




s . 


ha emo 2 y t i c u s 


AG 


TtGGTGAAGA 




ATTGGTATCC 


ATGACACTTC 


TAA 


186 




s . 


/laemo-iyT-^ctis 


AG 


TtGGTGAAGA 




ATTGGTATCC 


ATGACACTTC 


TAA 


189 




e 




AG 


TtGGTGAAGA 


AgTtGAAATC 


ATTGGTATCC 


ATGACACTTC 


TAA 


190 




s. 


jhaemolyticus 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAaACTTC 


TAA 


188 


25 


s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAaACTTC 


TAA 


194 




s. 


hominis hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAaACTTC 


TAA 


191 




s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAaACTTC 


TAA 


193 




s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAaACTTC 


TAA 


195 




s. 


hominis 


AG 


TtGGTGAAGA 


AgTtGAAATt 


ATTGGTATCa 


AaGAtACTTC 


TAA 


196 


30 


s. 


lugdunensis 


AG 


TCGGTGAAGA 


AgTtGAAATt 


ATTGGTATCC 


AcGAtACTaC 


TAA 


197 




s. 


saprophyticus 


AG 


TCGGTGAAGA 


AATCGAAATC 


ATcGGTATgC 


AaGAagaaTC 


CAA 


198 




s. 


saprophyticus 


AG 


TCGGTGAAGA 


AATCGAAATC 


ATcGGTATgC 


AaGAagaaTC 


CAA 


200 




s. 


saprophy ti cus 


AG 


TCGGTGAAGA 


AATCGAAATC 


ATcGGTATgC 


AaGAagaaTC 


CAA 


199 




s. 


sciuri sciuri 


TG 


TtGGTGAAGA 


AgTtGAAATC 


ATcGGTtTaa 


cTGAagaaTC 


TAA 


201 


35 


s. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACTTC 


TAA 


187 




s. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACTTC 


TAA 


192 




s. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACTTC 


TAA 


202 




s. 


warneri 


AG 


TtGGTGAAGA 


AgTtGAAATC 


ATcGGTtTaC 


ATGACACTTC 


TAA 


203 




B. 


suJbtilis 


AG 


TCGGTGAcGA 


AgTtGAAATC 


ATcGGTcTtC 


AaGAagagag 


AAA 


_ a 


40 


E. 


coli 


AG 


TtGGTGAAGA 


AgTtGAAATC 


gTTGGTATCa AaGAgACTc a 


GAA 


78 




L. 


monocytogenes 


AG 


TtGGTGAcGA 


AgTaGAAgTt 


ATcGGTATCg 


AaGAagaaag 


AAA 


138 b 



Selected sequences for 
species - speci f ic 

45 hybridization probes CGGTGAAGA AATCGAAATC A 599 

ATTGGTATCC ATGACACTTC 594 



The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179) . Nucleotides in capitals are identical to the selected 
50 sequences or match those sequences. Mismatches are indicated by lower-case 
letters. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent application WO98/20157. 
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Annex XV: Strat gy for th si ction of Staphylococcus 
aureus- specific and of Staphylococcus 
epidexm£dis-sp cific hybridization prob s from 
tuf sequences. 



















SEQ ID 






521 




547 


592 




617 


NO . : 


s. 


aureus 


TACACCACA 


TACTGAATTC 


AAAGCAG . . 


.TTCTTCtCa 


AACTATCGtC 


CACAATT 


179 


s. 


aureus 


TACACCACA 


TACTGAATTC 


AAAGCAG . . 








178 


s. 


aureus 


TACACCACA 


TACTGAATTC 


AAAGCAG . . 


. TTCTTCtCa 


AACTATCG t C 


CACAATT 


176 


s. 


aureus 


TACACCACA 


TACTGAATTC 


AAAGCAG . . 


. TTCTTCtCa 


AACTATCGtC 


CACAATT 


177 


s. 


aureus aureus 


TACACCACA 


TACTGAATTC 


AAAGCAG . . 


. TTCTTCtCa 


AACTATCGtC 


CACAATT 


180 


s. 


auri cu J ari s 


TACACCACA 


cACTaAATTC 


ActGCAG . . 


. TTCTTCtCT 


AACTAcCGtC 


CACAATT 


181 


s. 


capitis capitis 


CACACCACA 


CACTaAATTC 


AAAGCGG . . 


.TTCTTCAgT 


AACTAcCGCC 


CACAATT 


182 


M. 


caseolyticus 


TACtCCACA 


TACTaAATTC 


AAAGCTG . . 


. TTCTTCACT 


AACTAcCGCC 


CtCAGTT 


183 


s. 


cohnii 


TACACCACA 


GACaaAoTTt 


AAAGCGG. . 


.TTCTTCAgT 


AACTATCQCC 


CACAATT 


184 


s. 


epi dermi di s 


TACACCACA 


c AC a aAATTC 


AAAGCTG . . 


. TTCTTCACT 


AACTATCGCC 


CACAATT 


185 


s . 


haemolyticus 


/-% f*% m\ 0*0Hmm 0% m. 

CACACCtCA 


cACaaAATTt 


AAAGCAG . . 


. TTCTTCACa 


AACTATCGtC 


CACAATT 


186 


s. 


haemolyticus 


CACACCtCA 


cACaaAATTt 


AAAGCAG . . 


. TTCTTCACa 


AACTATCGtC 


CACAATT 


189 


s. 


haemolyti cus 


CACACCtCA 


CACaaAATTt 


AAAGCAG . . 


. TTCTTCACa 


AACTATCGtC 


CACAATT 


190 


s. 


haemoly ti cus 


TACACCtCA 


cACaaAATTC 


AAAGCAG . . 


.TTCTTCACT 


AACTATCGtC 


CACAATT 


188 


s. 


hominis 


CACACCtCA 


cACaaAATTC 


AAAGCAG. . 


. TTCTTCACT 


AACTATCGtC 


CACAATT 


195 


s. 


hominis 


TACACCtCA 


CACaaAATTC 


AAAGCAG . . 


.TTCTTCACT 


AACTATCGtC 


CACAATT 


196 


s. 


hominis hominis 


TACACCtCA 


CACaaAATTC 


AAAGCAG . . 


.TTCTTCtCT 


AACTATCGtC 


CACAATT 


191 


s. 


hominis 


TACACCtCA 


cACaaAATTC 


AAAGCAG. . 


.TTCTTCtCT 


AACTATCGtC 


CACAATT 


193 


s. 


hominis 


TACACCtCA 


CACaaAATTC 


AAAGCAG. . 


.TTCTTCtCT 


AACTATCGtC 


CACAATT 


194 


s. 


lugrdunensis 


TACACCtCA 


CACTaAATTt 


AAAGCTG. . 


.TTCTTCtCa 


AACTACCGCC 


CACAATT 


197 


s. 


saprophyticus 


TACACCACA 


TACaaAATTC 


AAAGCGG. . 


.TTCTTCACT 


AACTAcCGCC 


CACAATT 


198 


s. 


saprophyticus 


TACACCACA 


TACaaAATTC 


AAAGCGG . . 


. TTCTTCACT 


AACTAcCGCC 


CACAATT 


199 


s. 


~saprophyt icus 


TACACCACA 


TACaaAATTC 


AAAGCGG . . 


.TTCTTCACT 


AACTACCGCC 


CACAATT 


200 


s. 


sciuri sciuri 


CACACCtCA 


CACTaAATTC 


AAAGCTG. . 


.TTCTTCACa 


AACTACCGCC 


CACAATT 


201 


s. 


warneri 


TACACCACA 


TACaaAATTC 


AAAGCGG . . 








192 


s. 


warner i 


TACACCACA 


TACaaAATTC 


AAAGCGG . . 


. TTCTTCAgT 


AACTAcCGCC 


CACAATT 


187 


s. 


warneri 


TACACCACA 


TACaaAATTC 


AAAGCGG . . 


.TTCTTCAgT 


AACTACCGCC 


CACAATT 


202 


s. 


warneri 


TACACCACA 


TACaaAATTC 


AAAGCGG. . 


.TTCTTCAgT 


AACTACCGCC 


CACAATT 


203 


B. 


subtilis 


CACtCCACA 


c Age aAATTC 


AAAGCTG . . 


. TTCTTCtCT 


AACTAcCGtC 


CtCAGTT 




E. 


coli 


CAAgCCgCA 


cACcaAgTTC 


gAAtCTG . . 


.TTCTTCAaa 


ggCTAcCGtC 


CgCAGTT 


7B 


L. 


monocytogenes 


TACtCCACA 


CACTaACTTC 


AAAGCTG . . 


. TTCTTCAaa 


AACTACCGCC 


CACAATT 


138 b 



Selected sequences 
for species-specific 
hybridization 

probes ACCACA TACTGAATTC AAAG 585 

TTCACT AACTATCGCC CACA 593 

The sequence numbering refers to the Staphylococcus aureus tuf gene fragment (SEQ ID NO. 
179) . Nucleotides in capitals are identical to the selected sequences or match those 
sequences. Mismatches are indicated by lower-case letters. "~ u indicate incomplete 
sequence data. Dots indicate gaps in the sequences displayed. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent application WO98/20157. 
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Annex XVI: Strategy for the s lection of the 
Staphylococcus hominis- specific hybridization 
probe from tuf s qu nces. 

5 



10 



15 



20 



25 



30 



35 



40 







3 58 




383 


Qirn Tfl "NTH 


s. 


aureus 


ATC 


ATcGGTtTac AtGAcACaTC 


TAA 


1 7Q 


s. 


aureus 


ATC 


ATcGGTtTac AtGAcACaTC 


TAA 


1 

-L / O 


s. 


aureus 


ATC 


ATcGGTtTac AtGAcACaTC 


TAA 


177 


s. 


aureus 


ATC 


ATcGGTtTac AtGAcACaTC 


TAA 


1 7ft 


s. 


aureus aureus 


ATC 


ATcGGTtTac AtGAcACaTC 


TAA 


180 


s. 


auricuiaris 


ATC 


ATcGGTATgA AAGAcggTTC AAA 


181 


s. 


capitis capitis 


ATC 


ATcGGTATCc Ac GAAACTTC 


TAA 


182 


Af. 


caseolyticus 


ATC 


ATTGGTtTaA ctGAAgaacC AAA 


IOj 


s. 


cohnii 


ATC 


ATcGGTATgc AAGAAgaTTC 


CAA 


1 PA 

J. o*± 


s. 


epidermidis 


ATC 


ATcGGTAToc AcGAAAC-TTC- 


TAA 




s. 


ha emolytricus 


ATC 


ATTGGTATCc At GAC ACTTC 


TAA 


1 Q£ 
J.OD 


s. 


haemolyticus 


ATC 


ATTGGTATCc At G Ac ACTTC 


TAA 




s. 


haemolyticus 


ATC 


ATTGGTATCc At GAc ACTTC 


TAA 




s. 


haemolyticus 


Al I 


ATTGGTATCA AAGAAACTTC 


TAA 


188 


s. 


hominis 


ATT 


ATTGGTATCA AAGAtACTTC 


TAA 


196 


s. 


hominis 


ATT 


ATTGGTATCA AAGAAACTTC 


TAA 


194 


s. 


hominis hominis 


ATT 


ATTGGTATCA AAGAAACTTC 


TAA 


191 


s. 


hominis 


ATT 


ATTGGTATCA AAGAAACTTC 


TAA 


193 


s. 


hominis 


ATT 


ATTGGTATCA AAGAAACTTC 


TAA 


195 


S- 


lugdunensis 


ATT 


ATTGGTATCc Ac G At ACT aC 


TAA 


197 


s. 


saprophyticus 


- —-ATC- 


-ATcGGTATgc AAGAAgaaTC 


CAA 


198 


s. 


saprophyti cus 


ATC 


ATcGGTATgc AAGAAgaaTC 


CAA 


200 


s. 


saprophyt i cus 


ATC 


ATcGGTATgc AAGAAgaaTC 


CAA 


199 


5. 


sciuri sciuri 


ATC 


ATcGGTtTaA ctGAAgaaTC TAA 


201 


s. 


warneri 


ATC 


ATcGGTtTac At GAc ACTTC 


TAA 


187 


s. 


warner i 


ATC 


ATcGGTtTac At GAc ACTTC 


TAA 


192 


s. 


warneri 


ATC 


ATcGGTtTac AtGAcACTTC 


TAA 


202 


s. 


warneri 


ATC 


ATcGGTtTac AtGAcACTTC 


TAA 


203 


B . 


subtilis 


ATC 


ATcGGTcTtc AAGAAgagag 


AAA 


_a 


E. 


coli 


ATC 


gTTGGTATCA AAGAgACTca 


GAA 


78 


L . 


monocytogenes 


GTT 


ATcGGTATCg AAGAAgaaag 


AAA 


138 b 



Selected sequence for 
species-specific 
hybridization probe 



ATTGGTATCA AAGAAACTTC 



597 



45 



50 



The sequence numbering refers to the Staphylococcus aureus tuf gene fragment 
(SEQ ID NO. 179) . Nucleotides in capitals are identical to the selected 
sequences or match those sequences . Mismatches are indicated by lower-case 
letters. Dots indicate gaps in the sequences displayed. 

a This sequence was obtained from Genbank accession #Z99104. 

b The SEQ ID NO. refers to previous patent application WO98/20157. 
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Annex XXX: 



Specific and ubiquitous prim rs for nucleic acid 
amplification (recA seguenc s) . 



SEQ ID NO. 


Nucleotide sequence 




Originating 

SEQ ID 
NO. 


DNA fragment 

*■» *-AV— X CT \J W. J. UC 

position 




Universal primers (recA) 








919 


5'-GGI CCI GAR TCI TMI GGI AAR 


AC 


918 a 


437-459 


920 b 


5 ' -TCI CCV ATI TCI CCI TCI AIY 


TC 


918 a 


701-723 


921 


5'-TIY RTI GAY GCI GAR CAI GC 




918 a 


515-534 


922 b 


5' -TAR AAY TTI ARI GCI YKI CCI 


CC 


918 a 


872-894 




Universal primers {radSl) 








935 


5' -GGI AAR WSI CAR YTI TGY CAY 


AC 


939 a 


568-590 


936 b 


5 '-TCI SIY TCI GGI ARR CAI GG 




939 a 


1126-1145 




Universal primers (dmcl) 








937 


5 '-ATI ACI GAR GYI TTY GGI GAR 


TT 


940 a 


1038-1060 


938 b 


5'-CYI GTI GYI SWI GCR TGI GC 




940 a 


1554-1573 



10 



15 



20 



25 



30 a Sequences from databases . 

b These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 



35 
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Annex XXIX 


s Specific and ubiquitous primers 
amplification (speA s guences) . 


for 


nucleic acid 


5 




Originating DNA fragment 




SEQ ID NO. 


Nucleotide sequence 


SEQ ID 
NO . 


Nucleotide 
posi tion 


10 


Bacterial 


species : Streptococcus pyogenes 






15 


994 
995^ 

996 
997 b 


5 ' -TGG ACT AAC AAT CTC GCA AGA GG 
5 ' -ACA TTC TCG TGA GTA ACA GGG T 

5 ' -ACA AAT CAT GAA GGG AAT CAT TTA G 
5 ' -CTA ATT CTT GAG CAG TTA CCA TT 


993 a 
993a 

993 a 
993 a 


60-82 
173-194 

400-424 
504-526 


20 


998 
997b 


5 ' -GGA GGG GTA ACA AAT CAT GAA GG 

5 ' -CTA ATT CTT SAG GAG TTA CCA TT 


993 a 
993 a 


391-413 
504-526 



a Sequence from databases . 

k These sequences are from the complementary DNA strand of the sequence of the 
25 originating fragment given in the Sequence Listing. 
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Annex XXXIX: Sp cific and ubiquitous primers for nucleic acid 
amplification {atx s quences) • 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 Toxin gene : atx x 

1081 5' -ATG TCA GAG GGA TAG ATC CA 1076 a 233-252 
1080 b 5' -TAT AGC TAC TGT CAC CAG ACA ATG T 1076 a 394-418 

15 Toxin gene: atx 3 

1078 5'-AGT TCT GCG TTT TGT CAC TGT C 1077 a 546-567 

1079 b 5'-CGG AAG CAC ATT GCT GAT T 1077 a 687-705 

20 Toxin genes: stx x and atx a 

1082 5'-TTG ARC RAA ATA ATT TAT ATG TG 1076 a 287-3 09 
1083° 5'-TGA TGA TGR CAA TTC AGT AT 1076 a 790-809 

25 — 



Sequences from databases . 

These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Ann x XXXIII: Hoi cular beacon internal probeB for hybridization 
and specific detection of toxin seguenc s. 



5 Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 

10 

Toxin gene: stx t 

1084 5 ' - CCA CGC CGC TTT GCT GAT TTT TCA CAT 

GTT ACC GCG TGG 1076 b 346-372 

15 

Toxin gene: stx 3 

1085 5 ' - CCA CGC CAC TGT CTG AAA CTG CTC CTG 

T G CGT GG 1077 b 617-638 

20 

a Underlined nucleotides indicate the molecular beacon's stem. 
b Sequences from databases . 
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Annex XXX rV: Specific and ubiquitous primers for nucleic acid 
amplification (van sequences). 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Resistance 


crene 


: 




vanA 
















1086 


5 


' -CTA 


V_ 1 v_ 


CCCZ 


CCT 


TTT 


car 1 


TT 






104y-1037 c * 


513- 


-532 13 


1087 c 


5 


' -CTC 


ACA 


GCC 


CGA 


AAC 


AGC 


CT 






1049-1057 a 


699- 


-718 b 


1086 


5 


• -CTA 


CTC 


CCG 


CCT 


TTT 


GGG 


TT 






1049-1057 a 


513- 


-532 b 


1088 c 


5 


' -TGC 


CGT 


TTC 


CTG 


TAT 


CCG 


TC 






1049-1057 a 


885- 


■904 b 


1086 


5 


' -CTA 


CTC 


CCG 


CCT 


TTT 


GGG 


TT 






1049-1057 a 


513- 


•532 b 


1089° 


5 


' -ATC 


CAC 


ACG 


GGC 


TAG 


ACC 


TC 






1049-1057 a 


933- 


•952 b 


1090 


5 


' -AAT 


AGC 


GCG 


GAC 


GAA 


TTG 


GAC 






1049-1057 a 


629- 


•649 b 


1091 c 


5 


' -AAC 


GCG 


GCA 


CTG 


TTT 


CCC 


AA 






1049-1057 a 


734- 


■753 b 


1090 


5 


' -AAT 


AGC 


GCG 


GAC 


GAA 


TTG 


GAC 






1049-1057 a 


629- 


•649 b 


1089 c 


5 


1 -ATC 


CAC 


ACG 


GGC 


TAG 


ACC 


TC 






1049-1057 a 


933- 


■952 b 


1092 


5 


1 -TCG 


GCA 


AGA 


CAA 


TAT 


GAC 


AGC 






1049-1057 a 


662- 


682 b 


1088 c 


5 


• -TGC 


CGT 


TTC 


CTG 


TAT 


CCG 


TC 






1049-1057 a 


885- 


904 b 


Resistance 


qen.e 






vauB 
















1095 


5 


'^CGA 


TAG AAG 


CAG 


CAG 


GAC 


AA 






1117 d 


473 


-492 


1096 c 


5 ' -CTG 


ATG 


GAT 


GCG 


GAA 


GAT 


AC 






1117 d 


611 


-630 


Resistance 


gene 






v&nAB 














1112 


5 


' -GGC 


TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437- 


•456 b 


1113 c 


5 


' -ACC 


GAC 


CTC 


ACA 


GCC 


CGA 


AA 






1049-1057, 1117 a 


705- 


•724 b 


1112 


5 


' -GGC 


TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437- 


456 b 


1114 c 


5 


' -TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 






1049-1057, 1117 a 


817- 


•837 b 


1115 


5 


' -TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 


CG 


1049-1057, 1117 a 


705- 


730 b 


1114 c 


5 


' -TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 






1049-1057, 1117 a 


817- 


837 b 


1116 


5 


' -TTT 


CGG 


GCT 


GTG 


AGG 


TCG 


GBT 


GHG 


CGG 


1049-1057 / 1117 a 


705- 


731 b 


1114 c 


5 


' -TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 






1049-1057, 1117 a 


817- 


837 b 


1112 


5 


' -GGC 


TGY 


GAT 


ATT 


CAA 


AGC 


TC 






1049-1057, 1117 a 


437- 


456 b 


1118 c 


5' 


' -TTT 


TCW 


GAG 


CCT 


TTT 


TCC 


GGC 


TCG 




1049-1057, 1117 a 


817- 


840 b 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 1051). 

c These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
d Sequences from databases . 
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Annex XXXIV: Specific and ubiquitous primers for nucleic acid 
amplification {van s quenc s) (continued). 



SEQ ID NO. Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



10 Resistance gene ; 



vanAB (continued) 



15 



20 



25 



30 



35 



40 



1115 5 '-TTT CGG GCT GTG AGG TCG GBT GHG CG 
1118 c 5' -TTT TCW GAG CCT TTT TCC GGC TCG 

1116 5 '-TTT CGG GCT GTG AGG TCG GBT GHG CGG 
1118 c 5 '-TTT TCW GAG CCT TTT TCC GGC TCG 

1119 5 '-TTT CGG GCT GTG AGG TCG GBT GHG C 
1118 c 5 '-TTT TCW GAG CCT TTT TCC GGC TCG 

1120 5' -TTT CGG GCT GTG AGG TCG GBT GHG 
1118 c 5' -TTT TCW GAG CCT TTT TCC GGC TCG 

1121 5'-TGT TTG WAT TGT CYG GYA TCC C 
llll c 5'-CTT TTT CCG GCT CGW YTT CCT GAT G 

1112 5' -GGC TGY GAT ATT CAA AGC TC 

1111° 5'-CTT TTT CCG GCT CGW YTT CCT GAT G 

1123 5 1 -TTT CGG GCT GTG AGG TCG GBT G 

1.11 l c 5'-CTT TTT CCG GCT CGW YTT CCT GAT G 

1112 5 ' -GGC TGY GAT ATT CAA AGC TC 

1124° 5 ' -GAT TTG RTC CAC YTC GCC RAC A 



Resistance gene: 



van CI 



1103 5'-ATC CCG CTA TGA AAA CGA TC 

1104 c 5'-GGA TCA ACA CAG TAG AAC CG 



1049-1057, 1117 a 705-730 b 

1049-1057, 1117 a 817-840 b 

1049-1057, 1117 a 705-731 b 

1049-1057, 1117 a 817-840 b 

1049-1057, 1117 a 705-729 b 

1049-1057, 1117 a 817-840 b 

1G4-9-1057, 1117 a 705-728 b 

1049-1057, 1117 a 817-840 b 

1049-1057, 1117 a 408-429 b 

1049-1057, 1117 a 806-830 b 

1049-1057, 1117 a 437-456 b 

1049-1057, 1117 a 806-830 b 

1049-1057, 1117 a 705-726 b 

1049-1057, 1117 a 806-83 0 b 

1049-1057 , 1117 a 437-456 b 

1049-1057 ,1117 s 757-778 b 



1058-1059 a 
1058-1059 a 



519-538 d 
678-697 d 



Resistance gene: 



vanCl, vanC2, vanC3 



45 



50 



1097 


5' 


-TCY 


TCA 


AAA 


GGG 


ATC 


ACW 


AAA 


GTM 


AC 


1058 


-1066 a 


607 


-632 d 


1098 c 


5' 


-TCT 


TCA 


AAA 


TCG 


AAA 


AAG 


CCG 


TC 




1058 


-1066 a 


787 


-809 d 


1099 


5' 


-TCA 


AAA 


GGG 


ATC 


ACW 


AAA 


GTM 


AC 




1058 


-1066 a 


610 


-632 d 


1100 c 


5 ' 


-GTA 


AAK 


CCC 


GGC 


ATR 


GTR 


TTG 


ATT 


TC 


1058 


-1066 a 


976- 


1001 d 


1101 


5' 


-GAC 


GGY 


TTT 


TTY 


GAT 


TTT 


GAA 


GA 




1058 


-1066 a 


787 


-809 d 


1102 c 


5' 


-AAA 


AAR 


TCG 


ATK 


CGA 


GCM 


AGA 


CC 




1058 


-1066 a 


922 


-944 d 



Resistance gene: 



vanC2-vanC3 



55 



60 



1105 5'-CTC CTA CGA TTC TCT TGA YAA ATC A 

1106° 5' -CAA CCG ATC TCA ACA CCG GCA AT 



1060-1066, 1140 a 
1060-1066, 1140 a 



487-511 e 
690-712 e 



These sequences were aligned to derive the corresponding primer. 

The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 1051) . 

These sequences are from the complementary DNA s Grand of the sequence of the originating fragment 
given in the Sequence Listing. 

The nucleotide positions refer to the vanCl sequence fragment (SEQ ID NO. 1058) . 
The nucleotide positions refer to the vanC2 sequence fragment (SEQ ID NO. 1140) . 
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Annex XXXIV: Sp cific and ubiquitous primers for nucleic acid 
amplification (van sequences) (continued) . 



Originating DNA fragment 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



10 












Sequencing primers 


( vanAB) 






1112 
llll b 


5 ' 
5' 


-GGC 
-CTT 


TGY 
TTT 


GAT 
CCG 


ATT CAA AGC TC 

GCT CGW YTT CCT GAT G 


1139 a 
1139 a 


737-756 
1106-1130 


15 












Sequencing primers 


( vanA, vanX, 


van IT) 




1150 
1151 b 


5 ' 
5' 


-TGA 
-TGC 


TAA 
TGT 


TCA 
CAT 


CAC CGC ATA CG 
ATT GTC TTG CC 


U41 a 
1141 a 


860-879 
1549-1568 


20 


1152 
1153 b 


5 ' 
5' 


-ATA 
-CTC 


AAG 
GTA 


ATG 
TGT 


ata rifir* err* tv 
CCC TAC AAT GC 


1141 a 
1141 a 


1422-1441 
2114-2133 


25 


1154 
1155 b 


5' 
5' 


-GTT 
-CAG 


TGA 
TGC 


AGC 
TTC 


ATA TAG CCT CG 
ATT AAC GTA GTC 


1141 a 
1141 a 


2520-2539 
3089-3109 












Sequencing primers 


(van CI) 




30 


1110 
1109 b 


5' 
5' 


-ACG 
-ACA 


AGA 
TCG 


AAG 
TGA 


ACA ACA GGA AGA CC 
TCG CTA AAA GGA GC 


1138 a 
1138 a 


122-144 
1315-1337 












Sequencing primers 


( vanC2-vanC3 ) 




35 


1108 
1107 b 


5' 
5 ' 


-GTA 
-CTC 


AGA 
ATT 


ATC 
TGA 


GGA AAA GCG GAA GG 
CTT CCT CCT TTG CT 


1140 a 
1140 a 


1-23 
1064-1086 



Sequences from databases . 



These sequences are from the complementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 
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Annex XXXV: 



Internal prob s for nucleic acid hybridization and 
specific detection of van sequences. 



10 



SEQ ID NO. 



Nucleotide sequence 



Originating DNA fragment 



SEQ ID 
NO. 



Nucleotide 
position 



Resistance gene: vanA 

1170 5' -ACG AAT TGG ACT ACG CAA TT 

15 Resistance gene: vanB 

1171 5' -ACG AGG ATG ATT TGA TTG TC 



1049-1057 a 639-658° 



1117 c 



560-579 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the vanA sequence fragment (SEQ ID NO. 
1051) . 

c Sequences from databases . 
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Annex XXXVT: 



Specific and ubiquitous primers for nucleic acid 
amplification (pbp sequences). 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Resistance aene ; 



pi>pla 



1129 
1131 c 

1130 
1131 



1125 
1126 c 

1125 
1128 c 

1127 
1126 c 



1142 
1143° 

1142 
1145 c 

1144 
1143 c 



5' -ATG ATG 
5 ' -CAT CTG 

5'-GAC TAT 
5 '-CAT CTG 



ACC GAM ATG ATG AAA AC 
GAG CTA CRT ARC CAG T 

CCA AGC ATG CAT TAT G 
GAG CTA CRT ARC CAG T 



1004-1018 a 
1004-1018 a 

1004-1018 a 
1004-1018 a 



Sequencing primers (pibpla) 



5 ' -ACT CAC 
5 ' -TTA TGG 

5 '-ACT CAC 
5'-GAC GAC 

5 ' -KCA AAY 
5 ' -TTA TGG 



AAC TGG GAT GGA TG 
TTG TGC TGG TTG AGG 

AAC TGG GAT GGA TG 
YTT ATK GAT ATA CA 

GCC ATT TCA AGT AA 
TTG TGC TGG TTG AGG 



1169 d 
1169 d 

1169 d 
1169 d 

1169 d 
1169 d 



Sequencing primers (pJbp2b) 



Nucleotide 
position 



681-703 b 
816-837 b 

456-477 b 
816-837 b 



873-892 
2140-2160 

873-892 
1499-1518 

1384-1403 
2140-2160 



5' 


-GAT 


CCT 


CTA 


AAT 


GAT 


TCT 


CAG 


GTG 


G 


1172 d 


1- 


-25 


5 ■ 


-CAA 


TTA 


GCT 


TAG 


CAA 


TAG 


GTG 


TTG 


G 


1172 d 


1481 


-1505 


5' 


-GAT 


CCT 


CTA 


AAT 


GAT 


TCT 


CAG 


GTG 


G 


1172 d 


1- 


•25 


5' 


-AAC 


ATA 


TTK 


GGT 


TGA 


TAG 


GT 






1172 d 


793 


-812 


5' 


-TGT 


YTT 


CCA 


AGG 


TTC 


AGC 


TC 






1172 d 


657 


-676 


5' 


-CAA 


TTA 


GCT 


TAG 


CAA 


TAG 


GTG 


TTG 


G 


1172 d 


1481 


-1505 



Sequencing primers (pbp2x) 



1146 
1147 c 

1146 
1149° 

1148 
1147 c 



5' 


-GGG 


ATT 


ACC 


TAT 


GCC 


AAT 


ATG 


AT 


1173 d 


219 


-241 


5' 


-AGC 


TGT 


GTT 


AGC 


VCG 


AAC 


ATC 


TTG 


1173 d 


1938 


-1961 


5' 


-GGG 


ATT 


ACC 


TAT 


GCC 


AAT 


ATG 


AT 


1173 d 


219 


-241 


5' 


-TCC 


YAC 


WAT 


TTC 


TTT 


TTG 


WG 




1173 d 


1231 


-1250 


5' 


-GAC 


TTT 


GTT 


TGG 


CGT 


GAT 


AT 




1173 d 


711- 


-730 


5' 


-AGC 


TGT 


GTT 


AGC 


VCG 


AAC 


ATC 


TTG 


1173 d 


1938- 


-1961 



« These sequences were aligned to derive the corresponding primer. 

1004) nUCle ° tide positions refer to phpla sequence fragment (SEQ ID NO. 

C ^?^ff^ en ? eS are fr - 0m th F ^Plementary DNA strand of the sequence of the 
originating fragment given in the Sequence Listing. 

a Sequences from databases . 
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Annex XXXVII: Internal prob s for nucleic acid hybridization 
and specific detection of pbp sequences. 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Resistance aene : pJbpla 



1132 


5' 


-AGT 


GAA 


AAR 


ATG 


GCT 


GCT 


GC 






1004 


-1018 a 


531 


-550 b 


1133 


5' 


-CAT 


CAA 


GAA 


CAC 


TGG 


CTA 


YGT 


AG 




1004 


-1018 a 


806 


-828 b 


1134 


5 ' 


-CTA 


GAT 


AGA 


GCT 


AAA 


ACC 


TTC 


CT 




1004 


-1018 a 


417 


-439 b 


1135 


5' 


-CAT 


TAT 


GCA 


AAC 


GCC 


ATT 


TCA 


AG 




1004 


~1018 a 


471 


-493 b 


1192 


5' 


-GGT 


AAA 


ACA 


GGA 


ACC 


TCT 


AAC 


T 




1004 


-1018 a 


759 


-780 b 


1193 


5' 


-GGT 


AAG 


ACA 


GGT 


ACT 


TCT 


AAC 


T 




1004 


-1018 a 


759 


-780 b 


1194 


5' 


-CAT 


TTC 


AAG 


TAA 


TAC 


AAC 


AGA 


ATC 




lG*04~lGl*o a 


485 


-508 b 


1195 


5' 


-CAT 


TTC 


AAG 


TAA 


CAC 


AAC 


TGA 


ATC 




1004- 


-1018 a 


485 


-508 b 


1196 


5' 


-GCC 


ATT 


TCA 


AGT 


AAT 


ACA 


ACA 


GAA 




1004 


-1018 a 


483 


-506 b 


1197 


5' 


-CAA 


ACG 


CCA 


TTT 


CAA 


GTA 


ATA 


CAA 


C 


1004- 


-1018 a 


478- 


-502 b 


1094 


5' 


-GGT 


AAA 


ACA 


GGT 


ACT 


TCT 


AAC 


TA 




1004- 


-1018 a 


759- 


-781 b 


1214 


5' 


-GGT 


AAA 


ACA 


GGT 


ACC 


TCT 


AAC 


TA 




1004- 


-1018 a 


759- 


-781 b 


1216 


5' 


-GGT 


AAG 


ACT 


GGT 


ACA 


TCA 


AAC 


TA 




1004- 


-1018 a 


759- 


-781 b 


1217 


5' 


-CAA 


ATG 


CCA 


TTT 


CAA 


GTA 


ACA 


CAA 


C 


1004- 


-1018 a 


478- 


-502 b 


1218 


5' 


-CAA 


ACG 


CCA 


TTT 


CAA 


GTA 


ACA 


CAA 


C 


1004- 


-1018 a 


478- 


-502 b 


1219 


5' 


-CAA 


ATG 


CTA 


TTT 


CAA 


GTA 


ATA 


CAA 


C 


1004- 


-1018 a 


478-502 b 


1220 


5' 


-CAA 


ACG 


CCA 


TTT 


CAA 


GTA 


ATA 


CGA 


C 


1004-1018 a 


478- 


-502 b 



a These sequences were aligned to derive the corresponding primer. 

b The nucleotide positions refer to the pbpla. sequence fragment (SEQ ID NO 
1004) . 
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Annex XXXIX: Internal probe for nucleic acid hybridization and 

specific detection of mac A. 



10 









Originating 


DNA fragment 


SEQ ID NO. 


Nucleotide 


sequence 


SEQ ID 
NO. 


Nucleotide 
position 


Resistance 


qene: 


mecA 






1177 


5'-GCT CAA 


CAA GTT CCA GAT TA 


1178 a 


1313-1332 



15 

a Sequence from databases 
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Annex XL: 



Sp cific and ubiquitous primers for nucleic acid 
amplification (hexA sequences). 



Originating DNA fragment 



SEQ ID NO. 



Nucleotide sequence 



SEQ ID 
NO. 



Nucleotide 
position 



Bacterial species ; 



Streptococcus pneumonias 



1179 
1181 b 



1179 
1182 b 



5 ' -ATT TGG TGA CGG GTG ACT TT 
5'-AGC AGC TTA CTA GAT GCC GT 



Sequencing printers 



5 ' -ATT TGG TGA CGG GTG ACT TT 
5 ' -AAC TGC AAG AGA TCC TTT GG 



1183 a 
1183-1191° 



1183 a 
1183 a 



431-450 
652-671 d 



431-450 
1045-1064 



a Sequences from databases . 

b These sequences are from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
c These sequences were aligned to derive the corresponding primer. 

d Jhe nucleotide positions refer to the hexA sequence fragment (SEQ ID NO. 
1183 ) . 
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Annex XLIs Internal probe for nucleic acid hybridization and 
specific detection of hexA seguenc s. 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 



Bacterial species: Streptococcus pneumoniae 

118 ° a 5'-TCC ACC GTT GCC AAT CGC A 1183-1191 b 629-647<= 



a This sequences is from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing. 
b These sequences were aligned to derive the corresponding primer. 
C ^he nucleotide Positions refer to the hexA sequence fragment { SEQ ID NO. 
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Annex XLIII: Specific and ubiquitous primers for nucleic acid 
amplification (pep sequence) . 



5 Originating DNA fragment 



10 



15 



SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Bacterial species: streptococcus pyogenes 

1211 5'-ATT CTT GTA ACA GGC TTT GAT CCC 1215 a 291-314 

1210° 5'-ACC AGC TTG CCC AAT ACA AAG G 1215 a 473-494 



a Sequences from databases . 

k These sequences are from the complementary DNA strand of the sequence of the 
20 originating fragment given in the Sequence Listing. 



191 



CA 02307010 2000-05-19 



10 



20 



Annex XL IV: Specific and ubiquitous primers for nucleic acid 
amplification (S. saprophyticus unknown gene 
sequences) . 



~" Originating DNA fragment 

SEQ ID NO. Nucleotide sequence SEQ ID Nucleotide 

NO. position 

Bacterial species : Staphylococcus saprophyticus 



1208 5'-TCA AAA AGT TTT CTA AAA AAT TTA C 74,1093, 169-193° 

15 1198 b 

1209 a 5'-ACG GGC GTC CAC AAA ATC AAT AGG A 74,1093, 355-379° 

1198 b 



a This sequence is from the complementary DNA strand of the sequence of the 

originating fragment given in the Sequence Listing, 
k These sequences were aligned to derive the corresponding primer. 
25 c The nucleotide positions refer to the S. saprophyticus unknown gene sequence 

fragment { SEQ ID NO. 1198). 
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Annex XLV: Molecular beacon internal probes for hybridization 
and specific detection of antibiotic r sistance 
gene sequ nces. 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 



Resistance 


aene : 




jnecA 


















1231 5 


T££ TCG 


CCG 


AAG ATA 


AAA 


AAG 


AAC 


CTC 


TGC 


1178° 


1291- 


1315 


Resistance 


aene : 




vanA 


















1239 5 


' -GCG AGC 


GCA 


GAC CTT 


TCA 


GCA 


GAG 


GAG 


GCT 


1051 


860- 


880 


1240 5 


' -GCG AGC 
TCG £2C 


CGG 
QQ 


CAA GAC 


AAT 


ATG 


ACA 


GCA 


AAA 


1051 


663- 


688 


Resists? 


aene : 




vanB 


















1241 5 


'-GCG AGC 
CTC GC 


GGG 


GAA CGA 


GGA 


TGA 


TTT 


GAT 


TGG 


1117 


555- 


577 



a Underlined nucleotides indicate the molecular beacon's stem, 
b Sequence from databases. 
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Annex XLVI: Molecular beacon internal probe for hybridization 
and specific detection of an unknown S. aureus g ne 
sequence • 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 

10 

Bacterial species : S. aureus 

1232 5'-GQh QQQ QCG CGA TTT TAT AAA TGA ATG TTG 1244 53-80 

15 ATA ACC GGC TCC 

a Underlined nucleotides indicate the molecular beacon's stem. 
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Annex XLVII i Molecular beacon internal probes for hybridization 
and specific det ction of tuf sequ nces. 



Originating DNA fragment 

SEQ ID NO. Nucleotide sequence 3 SEQ id Nucleotide 

NO. position 

Pacterial species: Bnterococcus f aecalis 

1236 5'-Q£G &GG CGT GGT GAA GTT CGC GTT GGT 883 370-391 
GGC T£Q C 

Bacterial speqjep : Enterococcus faeciun 

1235 5'-GCG AGC CGA AGT TGA AGT TGT TGG TAT 64 412-437 
TGC TGG CTC GC 

Bacterial sp ecies : Staphylococcus sp. other than S. aureus 

1233 5'-GCG AGC GTT ACT GGT GTA GAA ATG TTC 878 372-394 
CGG CTC GC 

a Underlined nucleotides indicate the molecular beacon's stein. 
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Annex XLVIII: Molecular beacon internal probes for hybridization 
and specific detection of ddl and mtl gen 
sequenc s. 

5 

Originating DNA fragment 



SEQ ID NO. Nucleotide sequence 3 SEQ ID Nucleotide 

NO. position 
10 



Bacterial species : E. faeclum (ddl) 

1237 5'-GCG AOC CGC GAA ATC GAA GTT GCT GTA 1242 334-359 
15 TTA GGS CTC ££ 

Bacterial species : E. faecalia (mtl) 

1238 5 ' - GCG AGC GGC GTT AAT TTT GGC ACC GAA 1243 631-656 
20 GAA GAS CTC GC 



Underlined nucleotides indicate the molecular beacon's stem. 
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«.IX. X£.™£ Action of » «*»°"" E - 
sequence. 

"Originating UNA UajiuuTT 

SB0 id ~ Nucleotide 
NO position 



Nucleotide sequence 



— Staphylococcus aureus 

sasterial-SBSsies 1 1244 

5 , _ AC T AAA TAA ACG CTC ATT CG 

1234 



35-54 
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10 



15 



What is claimed is: 

1. A repertory of nucleic acid sequences used for the detection and/or identification of a bacterial, fungal 
or parasitical species, genus, family or group, which repertory is created by amplifying the nucleic 
acids of a plurality of determined bacterial, fungal and parasitical species with any combination of the 
primer pairs shown in annexes I, II, XXI, XXII, XXXII, XXXIV, XXXVI, XL, XLHI and XLIV. 

2. A nucleic acid used for universal detection of any bacterium, fungus or parasite which is derived from 
the repertory of claim 1. 

3. A nucleic acid used for universal detection as set forth in claim 2, which has a nucleic acid sequence 
of at least 12 nucleotides capable of hybridizing with said any bacterium, fungus or parasite and with 
any one of SEQ ID NOs.: 543, 556-574, 636-655, 658-661, 664, 681-683, 694, 696-697, 699-700, 
708,812-815,911-917,919-922, 935-938, 1203-1207, 1212-1213, 1221-1229. 

4. A nucleic acid used for the specific and ubiquitous detection and for identification of a bacterial, 
fungal or parasital species, genus, family or group, which is derived from the repertory of claim 1. 

5. A nucleic acid as set forth in claim 4, which has a nucleic acid sequence of at least 12 nucleotides 
20 capable of hybridizing with the nucleic acids of said bacterial, fungal or parasitical species, genus, 

family or group and with any one of : 

SEQ ID NOs.: 

539, 540 for the detection and/or identification of Mycobacteriaceae family 

25 541, 542, 544 for the detection and/or identification of Pseudomonads group 

545, 546 for the detection and/or identification of Corynebacterium sp. 

547, 548, 1202 for the detection and/or identification of Streptococcus sp. 

549, 550, 582, 
583, 625, 626, 

30 627, 628, 1 199 for the detection and/or identification of Streptococcus agalactiae 

55 1, 552 for the detection and/or identification of Neisseria gonorrhoeae 

553, 575, 605, 
606, 707, 1175, 

1 176, 1233 for the detection and/or identification of Staphylococcus sp. 

35 554, 555 for the detection and/or identification of Chlamydia trachomatis 

576, 631,632, 
633, 634, 635, 

1 163, 1 164, 1 167 for the detection and/or identification of Candida sp. 

577, 1 156, 1 160 for the detection and/or identification of Candida albicans 
40 578, 1 166, 1 168 for the detection and/or identification of Candida dubliniensis 
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579 for the detection and/or identification of Escherichia coli 
580, 603, 1174. 

1236, 1238 for the detection and/or identification of Enterococcus faecalis 

581 for the detection and/or identification of Haemophilus influenzae 

5 584, 585, 586, 
587, 588, 1232, 

1234 for the detection and/or identification of Staphylococcus aureus 
589, 590, 591, 

592, 593 for the detection and/or identification of Staphylococcus epidennidis 

10 594, 595 for the detection and/or identification of Staphylococcus haemolyticus 

596, 597, 598 for the detection and/or identification of Staphylococcus hominis 
599, 600,601, 

695, 1208, 1209 for the detection and/or identification of Staphylococcus saprophyticus 

602, 1235, 1237 for the detection and/or identification of Enterococcus faecium 

1 5 604 for the detection and/or identification of Enterococcus gallinarum 

620, 1 122 for the detection and/or identification of Enterococcus casseliflavus, E. flavescens and E. 
gallinarum 

629, 630 for the detection and/or identification of Chlamydia pneumoniae 
656,657,271, 

20 1136,1137 for the detection and/or identification of Enterococcus sp. 

701, 702 for the detection and/or identification of Leishmania sp. 



- - 703,704,705, 

706, 793 for the detection and/or identification of Entamoeba sp. 

794, 795 for the detection and/or identification of Trypanosoma cruzi 

25 796, 797, 808, 

809, 8 1 0, 8 1 1 for the detection and/or identification of Clostridium sp. 
798, 799, 800, 
801,802, 803, 
804, 805, 806, 

30 807 for the detection and/or identification of Cryptosporidium parvum 

816,817,818, 

819 for the detection and/or identification of Giardia sp. 

820, 821, 822 for the detection and/or identification of Trypanosoma brucei 
823, 824 for the detection and/or identification of Trypanosoma sp. 

35 825, 826 for the detection and/or identification of Bordetella sp. 

923, 924, 925, 

926, 927, 928 for the detection and/or identification of Trypanosomatidae family 
933, 934 for the detection and/or identification of Enterobacteriaceae group 

935, 938 for the detection and/or identification of platelets contaminants group: Abiotrophia 

40 adiacens, Abiotrophia defectiva, Acinetohacter baumannii, Acinetobacter Iwoffi, 

Aerococcus viridans, Bacillus anthracis, Bacillus cereus, Bacillus subtilis, Brucella 
abortus, Burkholderia cepacia, Citrobacter diversus, Citrobacter freundii, Enterobacter 
aerogenes, Enterobacter agglomerans, Enterobacter cloacae, Enterococcus avium, 
Enterococcus casseliflavus, Enterococcus dispar, Enterococcus durans, Enterococcus 
45 faecalis, Enterococcus faecium, Enterococcus flavescens, Enterococcus gallinarum, 
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Enterococcus mundtii, Enterococcus raffinosus, Enterococcus solitarius, Escherichia 
coli, Gemella morbillorum, Haemophilus ducreyi, Haemophilus haemolyticus, 
Haemophilus influenzae, Haemophilus parahaemolyticus, Haemophilus parainfluenzae, 
Hafnia alvei, Kingella kingae, Klebsiella oxytoca, Klebsiella pneumoniae, Legionella 
5 pneumophila, Megamonas hypermegale, Moraxella atlantae, Moraxella catarrhalis, 

Morganella morganii, Neisseria gonorrheae, Neisseria meningitidis, Pasteurella 
aerogenes, Pasteurella multocida, Peptostreptococcus magnus, Proteus mirabilis, 
Providencia alcalifaciens, Providencia rettgeri, Providencia rustigianii, Providencia 
stuartii, Pseudomonas aeruginosa, Pseudomonas fluorescens, Pseudomonas stutzeri, 

10 Salmonella bongori, Salmonella choleraesuis, Salmonella enteritidis, Salmonella 

gallinarum, Salmonella typhimurium, Serratia liquefaciens, Serratia marcescens, 
Shigella flexneri, Shigella sonnei, Staphylococcus aureus, Staphylococcus capitis 
Staphylococcus epidermidis, Staphylococcus haemolyticus, Staphylococcus hominis, 
Staphylococcus lugdunensis, Staphylococcus saprophyticus, Staphylococcus simulans, 

15 Staphylococcus warneri, Stenotrophomonas maltophilia, Streptococcus acidominimus, 

Streptococcus agalactiae, Streptococcus anginosus, Streptococcus bovis, Streptococcus 
constellatus, Streptococcus cricetus, Streptococcus cristatus, Streptococcus dysgalactiae, 
Streptococcus equi, Streptococcus ferus, Streptococcus gordonii, Streptococcus 
intermedius, Streptococcus macacac, Streptococcus mitis, Streptococcus mutans, 

28 Streptococcus oralis, Streptococcus parasanguinis, Streptococcus parauberis, 

Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus ratti, Streptococcus 
salivarius, Streptococcus sanguinis, Streptococcus sobrinus, Streptococcus uberis, 
Streptococcus vestibularis, Vibrio cholerae, Yersinia enterocolitica, Yersinia pestis, 
Yersinia pseudotuberculosis. 

25 994, 995, 996, 
997, 998, 999, 
1000, 1001, 1200, 

1210, 121 1 for the detection and/or identification of Streptococcus pyogenes 

1157 for the detection and/or identification of Candida parapsilosis 

30 1 158, 1 159 for the detection and/or identification of Candida glabrata 

1 160 for the detection and/or identification of Candida tropicalis 

1 161 for the detection and/or identification of Candida krusei 

1 162 for the detection and/or identification of Candida guilliermondii 
1 162 for the detection and/or identification of Candida lusitaniae 

35 1 165 for the detection and/or identification of Candida zeylanoides 

1201 for the detection and/or identification of Streptococcus pneumoniae, 

6. A nucleic acid as set forth in claim 4, which has a nucleic acid sequence of at least 12 nucleotides 
capable of hybridizing with the nucleic acids of said bacterial, fungal or parasitical species, genus, 

40 family, group or universal, and using the former in conjunction with any nucleic acid sequence of at 

least 12 nucleotides capable of hybridizing with a bacterial antibiotic resistance gene and/or toxin 
gene respectively listed in Tables 5 and 6, 

7. Nucleic acids as set forth in claim 6 where the nucleic acid capable of hybridizing with a bacterial 
45 antibiotic resistance gene and/or toxin gene is any one of : 
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SEQ IDNOs .: 

1078, 1079, 1085 for the detection and/or identification of the Shiga-Iike toxin 2 (stx 2 ) gene 

1080, 108 1 , 1084 for the detection and/or identification of the Shiga-like toxin 1 (stx } ) gene 

1082, 1083 for the detection and/or identification of Shiga-like toxins 1 (stx,) and 2 (stx 2 ) genes 

1086, 1087, 1088, 

1089, 1090, 1091, 

1092, 1170, 1239, 

i24 ° for the detection and/or identification of the vanA resistance gene 

1 171, 1241 for the detection and/or identification of the vanB resistance gene 

1111, 1112, 1113, 
1114, 1115, 1116, 
1118, 1119, 1120, 

1 12 1 , 1 1 23, 1 1 24 for the detection and/or identification of the vanAB resistance genes 
1103,1104,1109, 

1110 for the detection and/or identification of the vanCl resistance gene 

1105, 1106, 1107, 

1 108 for ^e detection and/or identification of the vanC2 and vanC3 resistance genes 

1097, 1098, 1099, 

1 100, 1 101, 1 102 for the detection and/or identification of the vanCl, vanC2 and vanC3 resistance genes 
1150, 1153, 1154, 

1 1 55 for the detection and/or identification of the vanAXY resistance genes 

1094, 1125, 1126, 

1127, 1128, 1129, 

1130, 1131, 1132, 

1133, 1134, 1135, 

1192. 1193, 1194, 

1195, 1196, 1197, 

1214, 1216, 1217, 

1218, 1219, 1220 for the detection and/or identification of the S. pneumoniae pbpla gene 
1142,1143,1144 

1 145 for the detection and/or identification of the S. pneumoniae pbp2b gene 

1146,1147,1148 

1 1 49 for the detection and/or identification of the S. pneumoniae pbplx gene 

1 177 for the detection and/or identification of the mecA resistance gene. 



8. 



A method for the specific detection or identification of a bacterial, fungal or parasitical species, genus, 
family or group in a sample, using a panel of probes or amplification primers or both, each individual 
probe or primer being derived from a nucleic acid as defined in claim 4, the method comprising the 
step of contacting the nucleic acids of the sample with said primers or probes under suitable conditions 
of hybridization or of amplification and detecting the presence of hybridized probes or amplified 
products as an indication of the presence of said specific bacterial, fungal or parasitical species, genus, 
family or group. 

A method as set forth in claim 8, which further comprises probes or primers, or both, for the detection 
of at least one of the antibiotic resistance genes listed in Table 5. 
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10. A method as set forth in claim 8 or 9, which further comprises probes or primers, or both, for the 
detection of at least one of the toxin genes listed in Table 6. 

5 11. A method as defined in claim 8, which further comprises the universal detection of any bacterium, 
fungus or parasite. 

12. A nucleic acid having the nucleotide sequence of any one of the tuf sequences listed in Table 7 or in 
Table 11, or a part of thereof and variants thereof, which can be used as target nucleic acids for the 

10 detection and identification of microbial species. 

13. A nucleic acid having the nucleotide sequence of any one of the atpD sequences listed in Table 7 or in 
Table 11, or a part of thereof and variants thereof, which can be used as target nucleic acids for the 
detection and identification of microbial species. 

15 

14. A nucleic acid having the nucleotide sequence of any one of the recA sequences listed in Table 7 or in 
Table 1 1, or a part of thereof and variants thereof, which can be used as target nucleic acids for the 
detection and identification of microbial species. 

20 15. A nucleic acid having the nucleotide sequence of any one of the antibiotic resistance genes sequences 
determined by us which are listed in Table 7, or a part of thereof and variants thereof, which can be 
used as target nucleic acids for the detection and identification of microbial species. 

16. A nucleic acid having the nucleotide sequence of any one of the antibiotic resistance genes sequences 
25 listed in Table 5, or a part of thereof and variants thereof, which can be used as target nucleic acids for 

the detection and identification of microbial species. 

17. A nucleic acid having the nucleotide sequence of any one of the toxin genes sequences listed in Table 
6, a part of thereof and variants thereof, which can be used as target nucleic acids for the detection and 

30 identification of toxins in microbial species. 

18. A repertory of hexA nucleic acid sequences used for the detection and/or identification of 
Streptococcus pneumoniae, which repertory is created by amplifying the nucleic acid of any 
streptococcal species with any combination of primers SEQ ID NOs. 1 1 79, 1 1 8 1 and 1 1 82. 



35 



19. A nucleic acid used for the specific and ubiquitous detection and for identification of Streptococcus 
pneumoniae, which is derived from the repertory of claim 18. 
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20. A nucleic acid used for detection as set forth in claim 19 which has a nucleic acid sequence of at least 
12 nucleotides capable of hybridizing with said any Streptococcus pneumoniae and with any one of 
SEQIDNOs. ll84to 1187. 

5 21. A nucleic acid as set forth in claim 19, which has a nucleic acid sequence of at least 12 nucleotides 
capable of hybridizing with the nucleic acids of Streptococcus pneumoniae and with any one of SEQ 
IDNOs. 1179, 1180, 1181, 1182. 

22. A peptide sequence derived from the translation of the nucleic acid sequences from the repertory of 
10 claim 1. 

23. The use of the peptide sequences of claim 22 as part of a vaccine. 

24. The use of the peptide sequences of claim 22 as targets of antimicrobial agents. 



15 



25. 



The use of the peptide sequences of claim 22 to design or improve antimicrobial agents. 



26. 



The use of tuf sequences to identify any of the microorganisms listed in table 15. 



20 



27. 



The use of atpD sequences to identify any of the microorganisms listed in table 15. 



28. 



The use of recA sequences to identify any of the microorganisms listed in table 15. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 
{ i ) APPLICANTS * 

5 BERGERON, Michel G. \ 1145 des Erables, Quebec City, 

Quebec, Canada, G2K 1T8 
BOISSINOT, Maurice \ 109 Jean-Bruch6si , St-Augustxn- 

de-Desmaures , Quebec, Canada, G3A 2N2 
HULETSKY, Ann \ 12 31 Av des Pins, Sillery, Quebec, 
10 Canada, G1S 4J3 

MENARD, Christian \ 1174 Rue du Pont, St-Lambert-de- 

L^vis, Quebec, Canada, GOS 2W0 
OUELLETTE, Marc \ 1035 de Ploermel, Sillery, Quebec, 
Canada, G1S 3S1 , , x 

15 PICARD, Frangois J. \ 1245 de la Sapimere, Cap-Rouge, 

Quebec, Canada, GlY 1A1 
ROY, Paul H. 2 , 28 Charles Gamier, Loretteville, 
Quebec, Canada, G2A 2x8 

2 q 1 : C anadi an c i ti z en sh i p 

2 :American citizenship 

Mi) TITLE OF THE INVENTION: HIGHLY CONSERVED GENES AN THEIR USE 
TO GENERATE SPECIES-SPECIFIC, GENUS -SPECIFIC AND UNIVERSAL 
nc NUCLEIC ACID PROBES AND AMPLIFICATION PRIMERS TO RAPIDLY 

DETECT AND IDENTIFY BACTERIAL, FUNGAL AND PARASITICAL 
PATHOGENS FROM CLINICAL SPECIMENS FOR DIAGNOSIS 



30 



(iii) NUMBER OF SEQUENCES: 940 
(iv) CORRESPONDENCE ADDRESS: 



(A) ADDRESSEE: 

(B) STREET: 
35 (C) CITY: 

(D) STATE: 

(E) COUNTRY: 

(F) ZIP : 

40 (v) COMPUTER READABLE: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING: 
45 (D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION: 
50 (B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

55 (A) APPLICATION: 

(B) FILING DATE: 
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(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 

(B) TELEFAX: 



-a 
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2 ) INFORMATION FOR SEQ ID NO : 1 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 750 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannn 

(B) STRAIN: ATCC 19606 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



CAAACTCGTG AGCACATCCT TCTTTCTCGT CAGGTAGGTG TACCTTACAT 50 

CATCGTATTC TTAAAC AAAT GCGACCTTGT TGATGACGAA GAATTACTTG 100 

2 0 AATTAGTAGA AATGGAAGTA CGTGAACTTC TTTCTACTTA TGACTTCCCA 15 0 

GGTGATGACA CTCCAGTAAT CCGTGGTTCA GCTCTTGCAG CGCTTAACGG 2 00 

TGAAGCTGGT CCTTACGGTG AAGAATCAGT TCTTGCTCTT GTAGCAGCAC 2 50 

TTGACTCTTA CATCCCAGAG CCAGAGCGTG CAATCGACAA AGCATTCTTG 300 

ATGCCAATCG AAGACGTATT CTCAATTTCT GGTCGTGGTA CAGTAGTAAC 3 50 

2 5 AGGCCGTGTT GAAGCTGGTA TCATCAAAGT TGGTGAAGAA GTAGAGATCG 400 

TTGGTATTAA AGATACAGTT AAAACAACTG TAACTGGCGT AGAAATGTTC 450 

CGTAAACTTC TTGAC GAAGG CCGTGCAGGT GAGAACTGTG GTATCTTACT 500 

TCGTGGTACT AAGCGTGAAG AAGTACAACG TGGTCAAGTA CTTGCTAAAC 550 

CAGGTACAAT CAAGCCGCAC ACTAAATTCG ACGCAGAAGT ATACGTACTT 600 

3 0 TCTAAAGAAG AAGGTGGTCG TCACACTCCA TTCTTAAATG GTTACCGTCC 650 

ACAGTTCTAC TTCCGTACAA CTGACGTAAC TGGTGCRATC CAGTTGAAAG 7 00 

AAGGCGTTGA AATGGTAATG C C AGGTG AC A ACGTTGAAAT GTCAGTAGAA 750 



35 
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2 ) INFORMATION FOR SEQ ID NO : 2 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:- 2 

CGGTGCGATC CTCGTGGTCG CCGCGACCGA CGGCCCCATG GCCCAGACCC 50 

GCGAGCACGT CCTGCTCGCC CGTCAGGTCG GCGTTCCCAC CATCCTCATC 100 

GCCCTCAACA AGTCCGACAT GGTTGACGAC GAGGAAATGA TGGAACTGGT 150 

55 CGAGGAGGAG TGCCGCGACC TGCTGGAGTC CCAGGACTTC GATCGCGATG 200 

CCCCGATCGT CCAGGTTTCC GCTCTGAAGG CCCTCGAGGG CGACGCGGAG 25 0 

TGGGTTGCCA AGATCGAGGA GCTCATGGAG GCTGTGGATT CCTACATCCC 3 00 

CACCCCCGAG CGCGATATGG ACAAGCCCTT CCTCATGCCG ATCG AGGAC G 350 

TCTTCACGAT CACAGGTCGT GGCACGGTCG TCACGGGGCG TGTTGAGCGT 400 

60 GGCAAGCTGC CGATCAACTC CGAGGTCGAG ATCCTCGGTA TCCGTGATCC 450 
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25 



CCAGAAGACC ACGGTCACCG GCATCGAGAT GTTCCACAAG TCGATGGACG 500 

AGGCATGGGC CGGCGAGAAC TGTGGCCTGC TGCTGCGCGG TACCAAGCGC 550 

GATGAGGTTG AGCGCGGC C A GGTTGTGGCC ATTCCCGGCT CCATCACGCC 600 

TC AC AC C GAG TTCGAGGGCC AGGTTTACAT CCTCAAGAAG GAAGAGGGCG 650 

GCCGTCACAA CCCGTTCTTC TCGAACTACC GTCCGCAGTT CTACTTCCGT 7 00 

ACCACGGACG TGACCGGCGT CATCACCCTC CCCGAGGGCA CCGACATGGT 750 

CATGCCTGGC GACACCACCG AGATCTCCGT TCAGCTGATC CAGCCCATCG 800 

CCATGGAGCC CGGCTGGGCT TCGCCA 826 



2 ) INFORMATION FOR SEQ ID NO: 3 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aerococcus virldans 

(B) STRAIN: ATCC 11563 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



TGGTGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTTTTAGCT GGCCAAATCG GTGTTCCTGC ATTCGTAGTA 100 

3 0 TTCTTAAACA AAGTTG AC C A AGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA GTACAACTAC CCAGGTGACG 2 00 

ATCTACCTGT AATCGCTGGT TCTGCTTTAT TAGCATTACA AGGCGATGAA 2 50 

GCTCAAGAAG C T AAAATC AT GGAATTAATG GAAGCTGTAG ACTCTTACAT 3 00 

TCCAGAACCA GAACGTGACA ACGACAAACC ATTCATGATG CCAATTGAGG 350 

3 5 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGTGAAG TTCGTACAGG TGACGAAGTT GACATCGTTG GTATTGCTGA 45 0 

ACAAATCGGT AAATCAGTTG TAAC TGGTGT TGAAATGTTC CGTAAAAACT 500 

TAG AC T AC GC TCAAGCTGGT GACAACATCG GTGCATTATT ACGTGGTGTT 550 

CAACGTGAAG ACATCCAACG TGGTCAAGTA TTGGCTGCTC CTGGTTCAAT 600 

40 CACTCCACAT ACTAAATTTA AAGCGCAAGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACACCA TTCTTAACTA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACTA CTGACATTAC TGGTGTTATC AC TTT AC C AG AAGACGTAGC 7 50 

TATGGTTATG CCTGGTGACA ACGTTGATAT GGACGTTGAA TTGATTCACC 800 

CAGTTGCGAT CGAAGATGGT AC TAAATTCT CTATC 83 5 

45 



2 ) INFORMATION FOR SEQ ID NO : 4 

50 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
60 denltrlf leans 
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(B) 



STRAIN: ATCC 1517 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



5 CCTGGTGGTG 
TCCTGCTGAG 
AAGGCCGACA 
AGTTCGCGAR 
TCGTGAAGGG 

10 GGCGAACAGG 
GACGCCTGAG 
TGTTCTCGAT 
GGCATCATCA 
GGTGAAGACG 

15 AAGGTCAAGC 
GAAGACGTCC 
GCACACGGAC 
GCCGTCACAC 
ACGACGGACG 

20 CCTGCCGGGC 
CCATGGAAGA 



TCGGCCGCTG 
CCGCCAGGTT 
TGGTTGACGA 
CTGCTGAGCA 
TTCGGCCAAG 
CCATCATGGC 
CGTGCCGTTG 
CTCGGGTCGC 
AGGTCGGCGA 
ACCTGCACGG 
CGGCGACAAC 
AGCGCGGCCA 
TTCACGTCCG 
TCCGTTCTTC 
TGACGGGCAC 
GACAACGTGG 
AGGCCTGCGT 



ACGGCCCGAT 
GGCGTGCCGT 
CGCCGAGCTG 
AGTACGACTT 
CTGGCGCTGG 
GCTGGCCGCT 
ACGGCGCGTT 
GGCACCGTGG 
GG AAATCGAA 
GCGTGGAAAT 
GTGGGCATCC 
GGTTCTGGCC 
AGGTGTACAT 
CAAGGCTATC 
GATCGAGCTG 
CCATGACGGT 
TCGCCAC 



GCCGCAAACG 
ACATCATCGT 
CTTGAGCTGG 
CCCGGGCGAC 
AAGGCGACAA 
GCGCTGGACT 
CCTGATGCCG 
TGACCGGCCG 
ATCGTCGGTC 
GTTCCGCAAG 
TRCTGCGCGG 
AAGCCGGGCT 
CCTGTCCAAG 
GTCCCCAGTT 
CCGGCCGACA 
CAAGCTGCTG 



CGCGAACACA 50 

CTTCCTGAAC 100 

TGGAAATGGA 150 

GACACCCCGA 2 00 

GGGCGAACTG 2 50 

CGTACATCCC 3 00 

GTTGAAGACG 3 50 

TATCGAACGC 4 00 

TGGTGCCGAC 4 50 

CTGCTGGACC 5 00 

CACCAAGCGT 5 50 

CGATCACCCC 600 

GAAGAAGGCG 6 50 

CTACTTCCGC 7 00 

AGGAAATGGT 7 50 

GCTCCGATCG 8 00 
827 



2 5 2 ) INFORMATION FOR SEQ ID NO : 5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

_ . (D)-^— TOP-OLOGY.:. .Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anaerorhabdus furcosus 

(B) STRAIN: ATCC 25662 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



TGGATCAATC CTAGTAGTTG CTGCAACTGA TGGACCAATG CCTCAAACTC 50 

GTGAAC ATAT CTTACTTGCT CGTCAAGTAG GTGTTCCAAG AATGGTTGTA 100 

TTCTTGAACA AATGCGACAT GGTTGAAGAT GAAGAATTAA TCGACCTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTAAGTGC TTACGGTTTC GAAGGTGATG 2 00 

45 ATACACCAGT TATCCGTGGT TCTGCATTAA AATCTCTTGA AGGAAATGCT 250 

GATTGGGAAG CAAAAGTTGC TGAATTAATG GATGCAGTTG ACTC TTGG AT 3 00 

TCCAACTCCA ACTCATGAAA CAGACAAACC ATTCTTAATG GCTGTTGAAG 350 

ATGTATTCAC AATTACAGGT CGTGGTACAG TTGCTACTGG ACGTGTTGAA 400 

CGTGGACACT TAAACCTTAA CGAAGAAGTT GAAATCGTTG GTATTCATGA 4 50 

50 T AC T AAG AAA TCAGTTGTTA CTGGTATCGA AATGTTCCGT AAATTATTAG 5 00 

ACTATGCTGA AGCAGGAGAC AACATTGGTG CATTATTACG TGGTGTTTCT 550 

CGTGATGAAA TCGAACGTGG AC AATGTC T A GCTAAACCTG GATCAGTTAC 600 

TCCACATACA GCTTTCAAAG CTCAAGTATA CGTATTAACT AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC GT AAC TAACT ACCGTCCTCA ATTC TATTTC 7 00 

55 CGTACAACTG ACGTAACAGG AGTTGTTAAA CTTCCTGAAG GTACTGAAAT 750 

GGTTATGCCT GGAGACAACA TCGAAATGAT CGTTGAATTA ATCGCTCCAA 800 

TCGCTGTTGA ACAAGGAACT AAG 823 



60 
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10 



2 ) INFORMATION FOR SEQ ID NO : 6 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracls 

(B) STRAIN: 4229 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACTC 5 0 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTACCTTA CATCGTTGTA 100 

TTCTTAAACA AATGCGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 15 0 

20 AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 2 00 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 2 50 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 3 00 

CCCAACTCCA G AAC GTG AAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 3 50 

ACGTATTCTC TATCACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 400 

25 CGCGGTATCG TTAAAGTTGG TGACGTAGTA GAAATCATCG GTCTTGCTGA 450 

AG AAAATGC T TCTACAACTG TAACTGGTGT AGAGATGTTC CGTAAACTTC 500 

TTGACCAAGC TCAAGCTGGA GACAACATCG GTGCTTTACT TCGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA CTTGCAAAAA GCGGTTCTGT 600 

AAAAGCTCAC GC TAAATTC A AAGCTGAAGT TTTCGTATTA TCTAAAGAAG 650 

3 0 AAGGTGGACG TCACACTCCA TTCTTCGCTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTATCATC CAATTACCAG AAGGTACTGA 750 

AATGGTAATG C C TGGTG AC A ACATCGAAAT GACTATCGAA CTTATCGCTC 800 

CAATCGCTAT CGAAGAGGGA ACTAA 825 



35 



50 



2 ) INFORMATION FOR SEQ ID NO : 7 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCTCAAACAC 5 0 

GTGAGCACAT CCTTCTTTCT CGTCAAGTAG GTGTTCCTTA CATCGTTGTA 100 

55 TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGCGACC TATTATCTGA ATACGGATTC CCAGGCGACG 200 

ACATTCCTGT AATCAAAGGT TCTGCTCTTA AAGCTCTTCA AGGAGAAGCT 25 0 

GATTGGGAAG CAAAAATCAT TGAATTAATG GCTGAAGTTG ATGCTTACAT 300 

CCCAACTCCA GAACGTGAAA CTGACAAACC ATTCTTAATG CCTGTAGAGG 35 0 

60 ACGTATTCTC TATCACAGGT CGTGGTACAG TTGCTACTGG TCGTGTTGAG 400 
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CGCGGTATCG TTAAAGTTGG 
AGAAAATGCT TCTACAACTG 
TTGACCAAGC TCAAGCTGGA 
GCTCGTGAAG ACATCCAACG 
5 AAAAGCTCAC GCTAAATTCA 
AAGGTGGACG TCACACTCCA 
TTCCGTACAA CTGACGTAAC 
AATGGTAATG CCTGGTGACA 
CAATCGCTAT CGAAGAGGGA 



TGACGTAGTA GAAATCATCG 
TAACTGGTGT AGAGATGTTC 
GACAACATCG GTGCTTTACT 
TGGACAAGTA CTTGCAAAAA 
AAGCTGAAGT TTTCGTATTA 
TTCTTCGCTA ACTACCGTCC 
TGGTATCATC CAATTACCAG 
ACATTGAAAT GACTATCGAA 
ACTAAATTC 



GTCTTGCTGA 450 

CGTAAACTTC 5 00 

TCGTGGGGTT 5 5 0 

GCGGTTCTGT 600 

TCTAAAGAAG 650 

TCAGTTCTAC 7 00 

AAGGTACTGA 7 5 0 

CTTATCGCTC 8 00 
829 



2) INFORMATION FOR SEQ ID NO: 8 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 
25 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

CGGTGCTATC ATCGTAGTTG CTGCTACTGA TGGTCCTATG CCTCAAACTC 50 

3 0 GCGAGCACAT CCTTTTGGCT CGTCAGGTAA ACGTTCCGAG ATTGGTTGTA 100 

-__ T _T-GA-TGAACA AGTGTG AC AT -GGTTGACGAC . G AGG AAATGT TGGAATTGGT 150 

TGAGATGGAG ATGAGAGAGT TGCTTTCATT CTATCAATTC GACGGTGACA 2 00 

ACACTCCGAT CATCCGTGGT TCTGCTCTTG GTGCATTGAA CGGTGATGCT 250 

CAATGGGAAG ATAAAGTAAT GGAGTTGATG GAAGCTTGTG ATACTTGGAT 300 

3 5 TCCTCTGCCT CCGCGCGAAA TCGACAAGCC GTTCTTGATG CCGGTTGAGG 35 0 

ACGTATTCTC AATCACGGGT CGTGGTACTG TTGCTACAGG TCGTATCGAG 400 

ACAGGTATTG TTAAGGTTGG TGAGGAAGTT CAGATCATCG GTCTTGGCGC 450 

TGCTGGTAAG AAATCTGTTG TTACAGGTGT TGAGATGTTC CGTAAGTTAT 500 

TGGATCAAGG TGAGGCTGGT GATAACGTTG GTTTGTTGCT TCGCGGTATC 55 0 

40 GATAAGAATG AGATCAAGCG TGGTATGGTA ATCTGCCACC CGGGTCAGGT 600 

TAAAGAGCAT TC T AAGTTC A AGGCTGAGGT TTATATCTTG AAGAAAGAGG 650 

AAGGTGGTCG TCACACTCCG TTCCACAACA AATATCGTCC TCAGTTCTAT 7 00 

ATCCGTACAT TGGATGTAAC TGGTGAGATC ACTTTGCCGG AAGGAACTGA 750 

AATGGTAATG CCGGGTGATA ACGTAACGAT CGAGGTTGAG TTGATCTATC 800 

45 CGGTAGCATG TAGCGTAG 818 



50 



2 ) INFORMATION FOR SEQ ID NO : 9 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus casseli flaws 

210 



CA 02307010 2000-05-19 



(B) STRAIN: R763 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

5 GGTCCTATGC CTCAAACACG TGAACACATC TTGTTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTTT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTACT AGAATTAGTT GAAATGGAAG TTCGTGACTT ATTGTCAGAA 150 

TATGACTTCC CAGGCGACGA TGTTCC TGTA ATCGCTGGTT CTGCTTTGAA 20 0 

AGCTCTTGAA GGCGATGCTT CATACGAAGA AAAAATCATG GAATTAATGG 250 

10 CTGCAGTTGA CGAATACGTT CCAACTCCAG AACGTGACAC TGACAAACCA 300 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATC AC TGG AC GTGGTACTGT 3 50 

TGCTACAGGC CGTGTTGAAC GTGGACAAGT TCGCGTTGGT G AC G AAGTTG 400 

AAATCGTTGG TATTGCTGAA GAAACTGCTA AAACAACTGT AAC TGGTGTT 450 

G AAATGTTC C GTAAATTGTT AG AC T ATGC T GAAGCAGGGG AT AAC ATTGG 500 

15 TGCATTGCTA CGTGGTGTTG CTCGTGAAGA CATCCAACGT GGACAAGTAT 550 

TGGCTAAAGC TGGTACAATC ACACCTCATA CAAAATTTAA AGCTGAAGTT 600 

TACGTTTTAA CAAAAGAAGA AGGTGGACGT CACACACCA 639 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 10 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 692 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
( vi ) ORIGINAL-SOURCE : 



"(A) ORGANISM: Staphylococcus saprophyticus 
(B) STRAIN: CSG 197 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GAACACATTC TTTTATCACG TAAC GTTGGT GTTCCAGCAT TAGTTGTATT 50 

CTTAAACAAA GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG 100 

AAATGGAAGT TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT 150 

4 0 GTACCTGTAA TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA 2 00 

CTATGAGCAA AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC 250 

CAACACCAGA ACGTGATTCT G AC AAAC CAT TCATGATGCC AG TTG AGG AC 3 00 

GTATTCTCAA TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG 3 50 

TGGTCAAATC AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG 400 

45 AATCAAGCAA AACAACTGTT ACTGGTGTAG AAATGTTC CG TAAATTATTA 4 50 

GACTACGCTG AAGCTGGTGA C AAC ATTGGT GCATTATTAC GTGGTGTTTC 500 

ACGTGATGAC GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA 5 50 

CACCACATAC AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA 600 

GGTGGTCGTC ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 650 

50 CCGTACTACT GACGTAACTG GTGTTGTTAA CTTACCAGAA GG 692 



2 ) INFORMATION FOR SEQ ID NO : 11 

55 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroldes ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



10 CGGTGCTATC 
GCGAACACAT 
TTCTTGAACA 
TGAAATGGAA 
ATACTCCTAT 

15 AAATGGGAAG 
TCCACTGCCT 
ACGTGTTCTC 
ACAGGTGTCA 
AGATAAGAAA 

20 ATCAAGGTGA 
AAG AAC G AAA 
ACCGCACTCT 
GTGGTCGTCA 
CGTACTATGG 

2 5 GGTAATGCCG 
TAGCATTGAA 



ATCGTTTGTG 
TC TGTTAGC T 
AATGC GAT AT 
ATGAGAGAAC 
CATCCGTGGT 
AC AAAGTTAT 
CCGCGCGATG 
TATCACAGGT 
TCCACGTTGG 
TCAGTTGTAA 
AGCTGGTGAC 
TCAAACGTGG 
AAATTCAAAG 
CACTCCGTTC 
ACTGTACAGG 
GGTGATAACG 
CCCGGGCTTC 



CTGCAACTGA 
CGTCAGGTAA 
GGTAGACGAC 
TCCTTTCATT 
TCTGCTCTTG 
GGAACTGATG 
TTG ATAAAC C 
CGTGGTACTG 
TGATGAAGTC 
CTGGTGTTGA 
AACGTAGGTC 
TATGGTTCTT 
CTGAGGTTTA 
CACAACAAAT 
TGAAATCACT 
TAACTATTAC 
G 



TGGTCCGATG 
ACGTACCTCG 
GAAGAAATGT 
CTATGATTTC 
GCGCATTGAA 
GATGCAGTTG 
ATTCTTGATG 
TAGCAACAGG 
GAAATTCTTG 
AATGTTCCGT 
TTTTGCTTCG 
TGTAAACCAG 
TATCTTGAAG 
ACCGTCCTCA 
TTGCCGGAAG 
AGTTGAGTTG 



CCTCAAACTC 50 

TCTGGTTGTA 100 

TGGAACTCGT 150 

GATGGTGACA 2 00 

CGGTGTTGAA 250 

ATAACTGGAT 300 

CCGGTTGAAG 3 50 

TCGTATCGAA 400 

GTTTAGGTGA 450 

AAACTGTTGG 500 

TGGTATTGAC 550 

GTCAGATTAA 600 

AAAGAAGAAG 650 

GTTCTACTTG 7 00 

GAACAGAAAT 750 

ATTT AC C C AG 8 00 
821 



30 . 2 ) INFORMATION FOR SEQ _ID NO: 12 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 838 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

45 



TGGTGCGATT TTGGTTGTTT CAGCTGCTGA TGGTCCGATG CCTCAAACAC 50 

GTGAGCATAT TCTTCTTGCC CGTCAGGTTG GTGTTCCAGC GATTGTTGTT 100 

TTTCTTAATA AGGTTGATCA GGTTGATGAT GCTGAGCTTT TGGAGC TTGT 150 

TGAGCTTGAA GTTCGGGAGT TATTGTCGAA ATATGATTTT CCAGGAGACG 200 

50 ATATTCCGAT CGTTAAAGGT TCTGCTTTGG CAGCGCTTGA AGATAAAGAT 2 50 

AAAAGCATTG GTGAAGATGC GGTTCGTCTT TTGATGAGTG AAGTTGATAA 300 

TTAT AT AC CG ACGCCTGAAC GTCCTGTTGA TCAGCCGTTT TTGATGCCAA 3 50 

TTGAAGATGT TTTTTCGATT TCGGGTCGTG GAACTGTTGT G AC GGGTCGT 4 00 

GTTGAGCGTG GTGTTATTAA GGTTGGTGAA GAAGTTGAGA TTATCGGCAT 450 

55 TCGTCCAACT TCTAAGACAA CAGTTACAGG GGTTGAAATG TTCCGCAAGC 500 

TTTTAGATCA GGGGCAAGCG GGTGATAATA TTGGAGCGCT GCTTCGTGGT 550 

ATTGATCGTG AAGGGATTGA GCGTGGACAA GTTTTGGCGA AGCCTGCTTC 600 

GGTTACACCT CATACGAGAT TTAAAGCAGA GGCTTACATT TTG AC GAAAG 650 

ATGAAGGTGG TCGTCATACT CCATTTTTCA CGAATTATCG TC CTC AGTTT 7 00 

60 TATTTCCGTA CTACGGATGT AACGGGAATT GTTACGCTTC CAGAAGGTAC 7 50 
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AGAGATGGTT ATGCCTGGTG ATAATGTTGC TATGGATGTC TCTCTGATTG 800 
TTCCAATTGC CATGGAAGAA AAACTTCGTT TTGCTATC 83 8 



2 ) INFORMATION FOR SEQ ID NO : 13 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 839 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentls 

(B) STRAIN : ATCC 15703 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TGGCGCCATC CTTGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTCCCGAA GATCCTCGTC 100 

GCTCTGAACA AGTGCGATAT GGTCGACGAC GACGAGCTCA TCGAGCTCGT 150 

25 TGAGGAAGAG GTCCGTGACC TCCTCGACGA AAATGGCTTC GATCGCGATT 2 00 

GCCCGGTCAT CCACGTGTCC GCTTACGGCG CACTGCACGA TGACGCTCCG 2 50 

GACCACGAGA AGTGGGTTGA GCAGATCAAG AAGCTCATGG ACGCCGTCGA 3 00 

TGACTACATC CCGACCCCGG TCCACGATCT GGACAAGCCG TTCCTGATGC 350 

CGATCGAAGA TGTCTTCACC ATCTCCGGCC GTGGCACCGT GGTGACCGGC 400 

30 CGTGTCGAGC GTGGTAAGCT CCCGGTCAAC TCCAACGTCG AGATCGTCGG 450 

CATCCGTCCG ACCCAGACCA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AG C AG ATGGA C G AGTGCG AG GCTGGCGACA ACACCGGTCT GCTGCTCCGC 550 

GGCATCAACC GTGACCAGGT CGAGCGTGGC CAGGTTCTGG CTGCTCCGGG 600 

CTCCGTGACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 65 0 

3 5 AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GTACCACCGA CGTCACCGGC GTCATCACCC TGCCGGAAGG 750 

CGTTGAGATG GTGCAGCCGG GCGATCACGC TACCTTCGGC GTTGAGCTGA 800 

TCCAGCCGAT CGCTATGGAA GAGGGCCTGA CCTTCGCAG 83 9 



40 



2 ) INFORMATION FOR SEQ ID NO: 14 



( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium dentium 

(B) STRAIN: ATCC 27534 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



55 



TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCTCAGACCC 50 

GCGAGCACGT GCTGCTCGCT CGTCAGGTGG GCGTGCCGCG TATCCTCGTC 100 

6 0 GCCCTGAACA AGTGCGATAT GGTCGACGAC GAAGAGCTCA TCGAGCTCGT 150 
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TGAGGAAGAG 
GCCCGGTCAT 
GACCACGACA 
CGAGTACATC 
5 CGATCGAAGA 
CGTGTCGAGC 
CATCCGTCCG 
AGCAGATGGA 
GGCATCAACC 
10 CTCCGTGACC 
AGGACGAAGG 
TTCTACTTCC 
CGTTGAGATG 
TCCAGCCGAT 

15 



GTCCGTGACC 
CCACACCTCC 
AGTGGGTTGA 
CCGACCCCGA 
TGTGTTCACC 
GTGGTAAGCT 
ACCCAGACCA 
CGAGTGCGAG 
GTGACCAGGT 
CCGCACACCA 
CGGCCGTCAC 
GTACCACCGA 
GTGCAGCCGG 
CGCTATGGAA 



TCCTCGACGA 
GCCTACGGCG 
GTCCGTCAAG 
CCCACGATCT 
ATCTCCGGCC 
CCCGGTCAAC 
CCACCGTCAC 
GCTGGCGACA 
CGAGCGTGGC 
AGTTCGAGGG 
TCGCCGTTCT 
CGTCACCGGC 
GCGATCACGC 
GAGGGCCTGA 



AAACGGCTTC 
CGCTGCACGA 
GAACTCATGA 
GGACAAGCCG 
GTGGC AC CGT 
TCCAACGTTG 
CTCCATCGAG 
AC AC CGGTCT 
CAGGTTCTGG 
CGAAGTCTAC 
TCTCCAACTA 
GTCATCACCC 
TACCTTCGGC 
CCTTCGCAG 



GATCGCGATT 2 00 

TGACGCTCCG 2 50 

AGGCCGTCGA 3 00 

TTCC TGATGC 3 50 

GGTTACCGGC 400 

AGATCGTCGG 450 

ACCTTCCACA 500 

GCTGCTCCGC 550 

CTGCTCCGGG 60 0 

GTGCTGACCA 65 0 

CCGTCCGCAG 7 00 

TGCCGGAAGG 7 5 0 

GTTGAGCTGA 800 
839 



2 ) INFORMATION FOR SEQ ID NO : 15 

2 0 ( i ) SEQUENCE CHARACTERISTICS: 

(A) I/ENGTH: 83'5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 

3 0 (B) STRAIN: S 2308 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

TGGCGCGATC CTGGTGGTTT CGGCTGCTGA CGGCCCGATG CCGCAGACCC 50 

3 5 GCGAGCACAT CCTGCTTGCC CGTCAGGTTG GCGTTCCGGC GATCGTCGTG 100 

TTCCTCAACA AGTGCGACCA GGTTG AC GAT GCAGAACTGC TCGAACTGGT 150 

TGAACTGGAA GTGCGCGAAC TTCTGTCGAA GTACGAATTC CCCGGCGACG 2 00 

AAATCCCGAT CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCC 2 50 

AAGGAACTGG GCGAAGATGC CATCCGCAAC CTGATGGACG CGGTTGACAG 3 00 

40 CTACATTCCG ACCCCGGAAC GCCCGATCGA CCAGCCGTTC CTGATGCCGA 35 0 

TCGAAGACGT GTTCTCGATC TCCGGCCGTG GTACGGTTGT GACGGGTCGC 4 00 

GTTGAGCGCG GTATCGTTAA GGTCGGTGAA GAAGTTGAAA TCGTCGGCAT 450 

CAAGGCGACG ACGAAGACCA CGGTTACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTCGACCA GGGCCAGGCT GGCGACAACA TTGGCGCGCT GATCCGCGGC 5 50 

45 GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

TGTGAAGCCG CACACCAAGT TTAAGGCAGA AGC CTATATT CTGACCAAGG 650 

ACGAAGGTGG CCGTCATACG CCGTTCTTCA CCAACTACCG TCCGCAGTTC 7 00 

TACTTCCGTA CGACGGACGT GACGGGTGTT GTGACGCTTC CGGCTGGCAC 750 

GGAAATGGTC ATGCCTGGCG ATAACGTCGC CATGGACGTT ACCCTGATCG 800 

50 TGCCGATCGC CATGGAAGAG AAGCTTCGCT TCGCTATC 83 8 



55 



2) INFORMATION FOR SEQ ID NO: 16 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 771 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Burkholderia cepacia 

(B) STRAIN: LSPQ 2217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

10 GGCAGCAGAC GGCCCGATGC CGCAAACGCG TGAGCACATC CTGCTGGCGC 50 

GTCAGGTTGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA GTGCGACATG 100 

GTGGACGACG CCGAACTGCT CGAGCTGGTC GAGATGGAAG TTCGCGAACT 150 

CCTGTCGAAG TACGACTTCC CGGGCGACGA CACGCCGATC GTGAAGGGTT 2 00 

CGGCGAAGCT GGCGCTGGAA GGCGACACGG GCGAGCTGGG CGAAGTGGCG 250 

15 ATCATGAGCC TGGCCGACGC GCTGGACACG TACATCCCGA CGCCGGAGCG 300 

TGCAGTTGAC GGCGCGTTCC TGATGCCGGT GGAAGACGTG TTCTCGATCT 350 

CGGGCCGCGG TACGGTGGTG ACGGGTCGTG TCGAGCGCGG CATCGTGAAG 4 00 

GTCGGCGAAG AAATCGAAAT CGTCGGTATC AAGCCGACGG TGAAGACGAC 450 

CTGCACGGGC GTTGAAATGT TCCGCAAGCT GCTGGACCAA GGTCAAGCAG 500 

2 0 GCGACAACGT TGGTATCCTG CTGCGCGGCA CGAAGCGTGA AGACGTGGAG 550 

CGTGGCCAGG TTCTGGCGAA GCCGGGTTCG ATCACGCCGC ACACGCACTT 600 

CACGGCTGAA GTGTACGTGC TGAGCAAGGA CGAAGGCGGC CGTCACACGC 650 

CGTTCTTCAA CAACTACCGT CCGCAGTTCT ACTTCCGTAC GACGGACGTG 7 00 

ACGGGCTCGA TCGAGCTGCC GAAGGACAAG GAAATGGTGA TGCCGGGCGA 7 50 

25 CAACGTGTCG ATCACGGTGA A 77 1 



2) INFORMATION FOR SEQ ID NO: 17 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Cedecea davlsae 

(B) STRAIN: ATCC 33431 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

45 GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 5 0 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCCCAG TACGACTTCC CGGGCGACGA 200 

TACTCCAATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

50 AGTGGGAAGC TAAAATCGTT GAGCTGGCTG GCTACCTGGA TTCTTACATC 300 

CCTGAGCCAG AGCGTGCTAT' CGATAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

55 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCAGG CTCTATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGC ACCATCGAAC TGCCAGAAGG CGTTGAGATG 7 50 

60 GTAATGCCTG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 
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CGCGATGGAT GACGGTCTGC GTTTCGCAA 



829 



5 2 ) INFORMATION FOR SEQ ID NO : 18 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 3 3 855 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

^ CGCTATCCTG GTTGTTGCTG CG AC TG AC GG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGC TGGTTG A 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGATGACA 2 00 

2 5 CTCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG C GAAGC AG AG 25 0 

TGGGAAGCTA AAATYGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 3 00 

AGAACCAGTA CGTGCAATCG AYCTGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 450 

30 TGCGAAATCT AC CTGTAC CG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGC TACC GTCCACAGTT CTACTTCCGT 7 00 

35 ACAACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 7 50 

AATGCCAGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCG 824 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 19 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 827 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CGCTATTCTG GTTGTTGCTG CAACTGACGG CCCTATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGC CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAAT GTGACATGGT TGATGACGAA GAGCTGCTGG AGCTGGTAGA 15 0 

60 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCA GGCGATGATA 2 00 
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CCCCAATCAT CCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAAGCAGAG 250 

TGGGAAGCTA AAATCGTTGA GCTGGCTGGC TTCCTGGATT CCTACATCCC 300 

AGAACCAGTA CGTGCAATCG ACCTGCCGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCCAT CTCCGGCCGT GGTACCGTTG TKACCGGTCG TGTAGAGCGC 400 

5 GGTATCGTTA AAGTGGGCGA AGAAGTAGAA ATCGTTGGTA TCAAAGATAC 450 

TGCGAAATCT ACCTGTAC TG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 5 00 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTTCTGGCT AAGCCAGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT CCTGTCCAAA GACGAAGGCG 650 

10 GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCACAGTT CTACTTCCGT 7 00 

ACCACTGACG TGACCGGTAC CATCGAACTG CCAGAAGGCG TAGAGATGGT 75 0 

AATGCCAGGT GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGG AC G A CGGTCTGCGT TTCGCAA 82 7 



15 



30 



60 



2 ) INFORMATION FOR SEQ ID NO: 2 0 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 { i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 029 



(xi) SEQUENCE -DESCRIPTION: -SEQ ID NO: 20 



GCGGAGCTAT CCTAGTCGTT TCAGCTACAG ACGGAGCTAT GCCACAAACT 50 

AAAGAACATA TCTTGCTAGC TCGCCAGGTT GGAGTTCCTT ATATCGTTGT 100 

35 TTTCTTGAAT AAAGTAGATA TGATCTCTCA AGAAGATGCT GAACTTATTG 150 

ACCTTGTTGA GATGGAACTT AGTGAGCTTC TTGAAGAAAA AGGCTACAAA 200 

GGATGCCCTA TTATCCGTGG TTCTGCTTTG AAAGCTCTTG AAGGTGATGC 250 

AAATTATATC GAAAAAGTTC GAGAACTTAT GCAAGCTGTG GATGACAACA 3 00 

TCCCTACACC AGAAAGAGAA ATTGATAAGC CTTTCTTAAT GCCTATCGAA 350 

40 GACGTATTCT CAATCTCTGG TCGTGGTACT GTGGTTACAG GAAGAATCGA 400 

GCGTGGAATC GTTAAAGTTT CTGATAAAGT TCAGCTCGTG GGATTAGGAG 450 

AGACTAAAGA AACAATCGTT ACTGGAGTCG AAATGTTCAG GAAAGAACTT 500 

CCTGAAGGTC GTGCAGGAGA AAAC GTTGGT TTACTCCTCA GAGGTATTGG 55 0 

AAAGAACGAT GTTGAAAGAG GTATGGTGGT TTGTCAGCCT AACAGCGTGA 600 

45 AGCC TC AT AC GAAATTTAAG TCAGCTGTTT ACGTTCTTCA GAAAGAAGAA 650 

GGCGGACGTC ATAAGCCTTT CTTCAGCGGA T AC AG AC C TC AGTTCTTCTT 7 00 

CCGTACTACA GACGTGACAG GAGTCGTAAC TCTTCCTGAA GGAACTGAAA 750 

TGGTAATGCC TGGAGATAAC GTTGAGCTTG ATGTTGAGCT CATTGGAACA 800 

GTTGCTCTTG AAGAAGGAAT GAGATTTGCA A 831 

50 

2 ) INFORMATION FOR SEQ ID NO : 21 

55 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pslttaci 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

TGGAGCGATT CTCGTTGTTT CCGCTACTGA CGGTGCGATG CCTCAGACCA 50 

AAGAACATAT TCTTTTGGCG AGACAGGTTG GTGTTCCTTA CATCGTTGTT 100 

10 TTCCTTAACA AAATC GAT AT GATTTCTCAA GAAGATGCTG AGCTCGTAGA 150 

CTTAGTTGAA ATGGAATTGT CCGAACTTCT AGAAGAAAAA GGTTATAAAG 200 

GTTGCCCAAT TATCCGTGGT TCTGCTTTGA AAGC CTT AG A AGGTGATGCA 250 

AGCTACGTTG AAAAAATTCG CGAGTTAATG CAAGCAGTGG ATGATAACAT 300 

CCCTACTCCA GAGCGTGAAG TTGATAAGCC TTTCTTAATG CCTATCGAAG 350 

15 ACGTATTCTC TATTTCTGGT CGTGGTACTG TGGTCACAGG ACGTATCGAG 400 

CGTGGAATCG TTAAAGTGGG TGATAAAGTA CAGATTGTTG GTTTAAGAGA 450 

TACTAGAGAG ACAATTGTTA CCGGTGTGGA AATGTTCAGA AAAGAACTTC 500 

CAGAAGGTCA AGCAGGGGAA AACGTTGGTT TGCTCCTCAG AGGTATCGGT 550 

AAGAATGACG TTGAAC GTGG TATGGTTATC TGCCAACCTA ATAGCGTGAA 600 

2 0 ATCTCACACA CAATTTAAAG GTGCTGTCTA CATTCTACAA AAAGAAGAGG 650 

GTGG ACGTG A TAAACCTTTC TTTACCSGAT ACAGACC^rCA GTTCTTCT'TC 700 

CGTACAACAG ATGTTACAGG TGTTGTAACT CTCCCAGAAG GTACAGAGAT 750 

GGTTATGCCA GGCGATAACG TTGAATTCGA AGTTCAATTA ATTAGCCCAG 800 

TAGCTCTAGA AGAAGGTATG AGATTT 82 6 

25 

2 ) INFORMATION FOR SEQ ID NO: 22 

30 (i) SEQUENCE CHARACTERISTICS: 
(-A) LENGTH-:- 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 
40 (B) STRAIN: LGV 12 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

GGGGC TATTC TAGTAGTTTC TGCAACAGAC GGAGCTATGC CTCAAACTAA 50 

45 AGAGCATATT CTTTTGGCAA GACAAGTTGG GGTTCCTTAC ATC GTTGTTT 100 

TTCTCAATAA AATTGACATG ATTTCCGAAG AAGACGCTGA ATTGG TCG AC 15 0 

TTGGTTGAGA TGGAGTTGGC TGAGCTTCTT GAAGAGAAAG GATACAAAGG 2 00 

GTGTCCAATC ATCAGAGGTT CTGCTCTGAA AGCTTTGGAA GGGGATGCTG 2 50 

CATACATAGA GAAAGTTCGA GAGCTAATGC AAGCCGTCGA TGATAATATC 3 00 

50 CCTACTCCAG AAAGAGAAAT TGACAAGCCT TTCTTAATGC CCATTGAGGA 3 50 

CGTGTTCTCT ATCTCCGGAC GAGGAACTGT AGTAACTGGA CGTATTGAGC 400 

GTGGAATTGT TAAAGTTTCC GATAAAGTTC AGTTGGTCGG TCTTAGAGAT 450 

ACTAAAGAAA CGATTGTTAC TGGGGTTGAA ATGTTCAGAA AAGAACTCCC 500 

AGAAGGTCGT GCAGGAGAGA ATGTTGGATT GCTCCTCAGA GGTATTGGTA 550 

55 AGAACGATGT GGAAAGAGGA ATGGTTGTTT GCTTGCCAAA CAGTGTTAAA 600 

CCTCATACAC GGTTTAAGTG TGCTGTTTAC GTTCTGCAAA AAGAAGAAGG 650 

TGGACGACAT AAGCCTTTCT TCACAGGATA TAG AC CTC AA TTCTTCTTCC 7 00 

GTACAACAGA CGTTACAGGT GTGGTAACTC TGCCTGAGGG AGTTGAGATG 750 

GTCATGCCTG GGGATAACGT TGAGTTTGAA GTGCAGTTGA TTAGCCCTGT 800 

60 GGCTTTAGAA GAAGGTATGA GA 822 

218 



CA 02307010 2000-05-19 



2 ) INFORMATION FOR SEQ ID NO : 23 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Chryseobacterium meningoseptlcum 

(B) STRAIN: CDC B7681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 3 

2 0 CGGAGCTATC TTAGTATGTG CTGCTACAGA TGGTCCAATG CCTCAAACTA 50 

GAGAACACAT CCTACTTTGC CGTCAGGTAA ACGTAC CT AG AATTGTTGTG 100 

TTCATGAACA AAGTTGACAT GGTAGATGAT CCAGAATTGT TAGAGCTTGT 150 

TGAGCTTGAA CTTAGAGATC TATTATCTAC TTACGAATAT GATGGTGATA 200 

ACTCTCCAGT AATTCAAGGT TCTGCTCTTG GTGCTCTTAA CGGTGATGCT 250 

2 5 AAGTGGGTAG CTACTGTAGA AGCTCTAATG GATGCTGTTG ATACTTGGAT 3 00 

CGAGCAACCA GTAAGAGATT CTGATAAGCC ATTCCTTATG CCAATCGAAG 3 50 

ACGTATTCTC TATTACAGGT AG AGGTAC TG TAGCAACTGG TAGAATCGAG 400 

GCTGGTGTAA TCAACACAGG TGATCCTGTT GACATCGTAG GTATGGGTGA 450 

CGAGAAGTTA ACTTCTACTA TTACAGGTGT TGAGATGTTT AGAAAAATCC 500 

3 0 TAGACAGAGG TGAAGCTGGT GATAACGTAG GTCTATTGTT GAGAGGTATT 550 

GAAAAGACTG ACATCAAGAG AGGTATGGTT ATCGCTAAGA AAGATTCAGT 600 

TAAGCCACAC AAGAAATTCA AAGCTGAGGT TTATATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCCACAACA AATACCGTCC TCAGTTCTAT 700 

GTAAGAACTA C TG ACGTTAC AGGTGAAATC TTCTTACCAG AAGGTGTAGA 7 50 

3 5 AATGGTAATG CCTGGTGATA ACTTAACTAT CACTGTAGAA TTGTTACAAC 800 

CAATCGCTCT TAACGAGGGT CTTAGATTCG CGATC 83 5 



40 2 ) INFORMATION FOR SEQ ID NO : 24 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: CI trobacter amalonatlcus 

(B) STRAIN: ATCC 25405 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

60 ACACCCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 
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GAGTGGGAAG CGAAAATCAT CGAACTGGCC GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

5 GACTGCCAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCWCCATCAA 600 

GCCGCACACC ATGTTCGAAT CYGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

10 CGTACAACTG ACGTG AC TGG CACCATCGAA CTGCCGGAAG GCGTTGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGG 816 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO : 25 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

CO STRANDEDNESS : Double 
( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrohacter braakii 

(B) STRAIN: ATCC 43162 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 



CGCGATCCTG GTTGTTGCTG CAACTGACGG CCCGATGCCG CAGACTCGTG 50 

AGCACATCCT GCTGGGTCGY CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AAC TGGT AG A 15 0 

3 5 AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGATTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGAWGCAGAG 25 0 

TGGGAAGCGA AAATCATCGA AC TGGCTGGC TTCCTGGATT CTTACATCCC 300 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCT ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

40 GGTATCATCA AAGTTGGTGA AGAAGTTGAA ATCGTTGGTA TCAARGACAC 45 0 

TGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCT CTATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GACGAAGGCG 650 

45 GCCGTCATAC TCCGTTCTTC AARGGCTACC GTCCGCAGTT CTACTTCCGT 700 

AC T ACTGAC G TGAC TGGTAC C ATCGAAC TG CCGGAAGGCG TTGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCAATCG 800 

CGATGGACGA CGGTCTGCGT TTCGC 82 5 



2 ) INFORMATION FOR SEQ ID NO : 2 6 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 5 0 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAAC TGGT 15 0 

TGAGATGGAA GTGCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCT 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATYGTTG GTATCAAAGA 450 

GACTGCGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GYTCCATCAA 600 

2 0 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATYCTGTCY AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTG AC TGG CACCATCGAA CTGCCGGAAG GCG TAG AGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGAGGAC GACGGTCTGC GTTTCGCAA 829 

25 



2 ) INFORMATION FOR SEQ . ID NO: 27 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter farmer! 
40 (B) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 5 0 

45 AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTC CGT AC AT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AAC TGGTAGA 150 

GATGGAAGTT CGTGAACTGC TGTCTCAGTA CGATTTCCCG GGC G AC G AC A 2 00 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCAGGC TTCCTGGATT CTTACATCCC 300 

50 GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCTGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCATCA AAGTGGGTGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

TGCCAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

5 5 GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCW CCATCAAGCC 600 

RC AC AC TATG TTCGAATCTG AAGTGTACAT TC TGTC C AAA GACGAAGGCG 65 0 

GCCGTCATAC TCCGTTC TTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 7 00 

ACGACTGACG TGACTGG CAC CATCGAACTG CCGGAAGGTG TTGAGATGGT 750 

TATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CACCCGATCG 800 

60 CGATGGACGA CGGTCTGCGT TTCGCAA 827 
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2) INFORMATION FOR SEQ ID NO: 28 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Citrobacter freundli 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

20 CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 15 0 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACTCCGA 2 00 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGAAGC AGAGTGGGAA 250 

2 5 GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCAGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCTATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTGCTAA 45 0 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

3 0 GTGCTGGTGA GAACGTTGGT GTTCTGCTGC GTGGTATCAA AC GTG AAGAA 550 

ATCGAACGTG . GTCAGGTACT GGCTAAGCCG GGCTCTATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGACGAA GGCGGCCGTC 65 0 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 700 

G AC GTG AC TG GTACCATCGA ACTGCCGGAA GGC GT AG AG A TGGTAATGCC 750 

3 5 GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCGA 797 



40 



2) INFORMATION FOR SEQ ID NO : 2 9 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Citrobacter sedlakll 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

55 CGGCGCGATC CTGGTTGTTG CCGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAAC TGGT 150 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACGCCGAT CGTTCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

60 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 
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TCCGGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT 
GACTGCGAAG TCTACCTGTA CTGGCGTTGA 
5 ACGAAGGCCG TGCGGGTGAG AACGTAGGTG 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACAACTG ACGTGACTGG CACCATCGAA 
10 GGTAATGCCG GGCGACAACA TCAAAATGGT 
TCGCGATGGA CGACGGTCTG CGTTTC 



GTTCCTGCTG CCGATCGAAG 3 50 

TTGTTACCGG TCGTGTAGAG 400 

GAAATCGTTG GTATCAAAGA 450 

AATGTTCCGC AAACTGCTGG 500 

TTCTGCTGCG TGG T ATC AAA 550 

GCGAAGCCGG GCACCATCAA 600 

TATTCTGTCC AAAGATGAAG 650 

ACCGTCCGCA GTTCTACTTC 7 00 

CTGCCGGAAG GCGTAGAGAT 750 

TGTTACCCTG ATCCACCCGA 800 

826 



15 2 ) INFORMATION FOR SEQ ID NO: 30 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



GCGATCCTGG TTGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 15 0 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GATTTCCCGG GCGACGACAC 2 00 

35 TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GAAGCAGAGT 2 50 

GGGAAGCGAA AATCATCGAA CTGGCTGGCT TTCTG G ATTC TTACATCCCG 3 00 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTRCCTA TCGAAGACGT 350 

ATTC TCC ATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 4 00 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT C AAAG AC AC C 45 0 

40 GCTAAGTCTA CCTGTACCGG CGTTGAAATG TTCCGCAAAC TGCTGG AC G A 500 

AGGCCGTGCT GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCTC TATCAAGCCG 600 

C AC AC C AAGT TCGAATCTGA AGTGTACATC CTGTCCAAAG ACGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 7 00 

45 CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 75 0 

ATGCCGGGCG ACAACATYAA AATGGTTGTT ACYCTGATCC ACCCGATCGC 800 

GATGGACGAC GGTCTGCGTT TCG 823 



50 

2 ) INFORMATION FOR SEQ ID NO : 31 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 

223 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrohacter youngae 

(B) STRAIN: ATCC 29935 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 5 0 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 15 0 

10 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 2 00 

TACGCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCAG 2 50 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCGGAACCAG AACGTGCTAT CGATAAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACTGGT CGTGTAGAAC 400 

15 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 45 0 

ACTGCCAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 5 00 

CGAAGGCCGT GC TGGTG AG A ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 5 50 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTC TGTCC A AAGACGAAGG 650 

20 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACTACTGA CGTGACGGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCGATGGAT GACGGTCTGC GTTTCG 82 6 



25 
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2 ) INFORMATION FOR SEQ ID NO : 32 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 { ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



CGGAGCTATA TTAGTTTGTT CAGCAGCTGA TGGTCCAATG CCTCAAACAA 50 

GAGAGCACAT CTTATTATCA TCAAGAGTTG GAGTTGACCA CATCGTAGTA 100 

45 TTCTTAAACA AAGCAGATAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTAGAGAGT TATTAAGCGA GTACAACTTC CCAGGAGACG 2 00 

AYATTCCAGT AATCAARGGA TCAGCTTTAG TAGCATTAGA AAACCCAACT 25 0 

GACGAAGCTG CAACAGCTTG TATCAGAGAG TTAATGGATG CTGTAGATAG 300 

CTACATCCCA AC AC C AG AAA GAGCAACAGA TAAGCCATTC TTAATGC C AG 3 50 

50 TAGAGGACGT ATTCACAATC ACTGGTAGAG GAACAGTTGC AACAGGAAGA 400 

GTTGAAAGAG GAGTTCTACA TGTAGGAGAC GAAGTAGAAG TAATCGGATT 450 

AACTGAAGAA AGAAGAAAAA CTGTTGTAAC AGGAATCGAA ATGTTCAGAA 500 

AGTTATTAGA TGAAGCACAA GC TGG AG AT A ACATCGGAGC ATTATTAAGA 550 

GGTATCCAAA GAACTGAYAT CGAAAGAGGT CAAGTTTTAG CTCAAGTTGG 600 

55 AACAATCAAC C CAC AC AAAA AATTCGTAGG TCAAGTATAC GTACTTAAAA 65 0 

AAGAAGAAGG TGGAAGACAT ACTCCATTCT TCGATGGATA CAGACCACAA 700 

TTCTACTTCA GAACAACAGA CGTTACAGGA TCAATCAAAT TACCAGAAGG 750 

AATGGAAATG GTTATGCCTG GAGACCACAT CGACATGGAA GTTGAATTAA 800 

TCACAGAAAT CGCTATGGAY GAAGGATTAA GATTCGCTAT C 841 

60 
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2 ) INFORMATION FOR SEQ ID NO : 33 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 
15 (B) STRAIN: ATCC 15668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33 

CGGCGCCATC CTGGTGTGCT CGGCCGCTGA CGGCCCCATG CCCCAGACCC 50 

2 0 GCGAGCACAT CCTGCTGGCC CGTCAGGTGG GCGTGCCCTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTGGACGAC GAAGAGCTGC TGGAACTGGT 150 

CGAAATGGAA GTGCGCGAGC TGCTTGCCAA GTACGACTTC CCCGGCGACG 2 00 

ACACCCCCAT CATCCGCGGC TCGGCCAAGC TGGCCCTGGA AGGCGACCAG 250 

TCCGACAAGG GCGAACCTGC CATCCTGCGC CTGGCTGAAG CACTGGACTC 300 

2 5 CTACATCCCC ACGCCCGAGC GCGCTGTGGA CGGCGCCTTT GCAATGCCCG 3 50 

TGGAAGACGT GTTCTCGATC TCTGGCCGTG GCACCGTGGT GACTGGCCGT 400 

ATCGAGC GCG GCATCATCAA GGTCGGCGAA GAAATCGAAA TCGTCGGTAT 450 

CCGCGACACC CAGAAGACCA TCGTC AC CGG CGTGGAAATG TTCCGCAAGC 500 

TGCTGGACCA AGGTCAAGCT GGCGACAACG TGGGTCTGCT GCTGCGCGGC 550 

3 0 ACCAAGCGTG AAGACGTGGA ACGCGGCCAA GTGCTGTGCA. AGCCCGGCTC 6Q0 

CATCAAGCCC CACACCCACT TCACGGCTGA GGTGTAC GTG CTGTCCAAGG 650 

ACGAAGGTGG TCGCCACACT CCGTTCTTCA ACAACTACCG TCCCCAGTTC 7 00 

TATTTCCGTA CGACCGACGT GACCGGCTCC ATCGAGCTGC CCGCCGACAA 7 50 

GGAAATGGTG ATGCCTGGCG ACAACGTGTC GATCACCGTC AAGCTGATCG 800 

3 5 CCCCCATCGC CATGGAAGAA GG 822 



40 



2 ) INFORMATION FOR SEQ ID NO : 34 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 02 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: CoryneJbacterium bovis 

(B) STRAIN: ATCC 7715 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

55 GCCGCAGACC CGTGAGCACG TCCTCCTGGC CCGTCAGGTC GGTGTGCCCT 50 

ACATCCTCGT CGCCCTCAAC AAGTGCGACA TGGTCGACGA CGAGGACCTC 100 

ATCGAGCTCG TCGAGATGGA GGTCCGTGAG CTCCTCGCCG AGCAGGACTA 150 

CGACGAGGAC GCCCCGATCA TCCACATCTC CGCCCTCAAG GCCCTCGAGG 2 00 

GTG AC CCGGA GTGGACGCAG CGCATCGTCG AC CTC ATGAA GGCCTGCGAC 250 

60 GACGCCATCC CGGATCCGGA GCGCGAGACG GACAAGCCGT TCCTCATGCC 300 
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GATCGAGGAC ATCTTCACGA TCACCGGCCG CGGCACCGTC GTCACGGGCC 3 50 

GTGTCGAGCG TGGCATCCTC AACGTCAACG AGGAGGTCGA GATCCTGGGT 40 0 

ATCTGCGAGA ACTCCCAGAA GACGACCGTC ACCTCCATCG AGATGTTCAA 45 0 

CAAGTTCCTC GACACGGCCG AGGCCGGCGA CAACGCCGCC CTGCTGCTCC 500 

5 GTGGCCTGAA GCGCGAGGAC GTCGAGCGTG GCCAGATCGT GGCCAAGCCG 550 

GGCGCCTACA CGCCGCACAC CGAGTTCGAG GGCTCCGTGT ACATCCTCTC 600 

CAAGGACGAG GGTGGCCGCC ACACGCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTTCTACTT CCGGACGACC GACGTCACCG GCGTCGTCAA GCTGCCGGAG 7 00 

GG 702 

10 

2 ) INFORMATION FOR SEQ ID NO : 35 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TO POLOGY : Linear 



20 



(ii) MOLECULE TYPE; Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxynebacterlum cervicis 
25 (B) STRAIN: NCTC 10604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

GGCTCAGACC CGCGAGCACG TTCTGCTTGC TCGCCAGGTT GGCGTTCCGA 50 

30 CGATCCTGGT TGCCCTCAAC AAGGCCGATA TGGTCGACGA TGAGGAAATG .100 

CTGGAGCTCG TTGAGGAAGA GTGCCGCGAC CTGCTCGAGT CCCAGGACTT 15 0 

CGATCGTGAC GCCCCGATCA TCCAGGTTTC CGCGCTGAAG GCTCTCGAAG 2 00 

GTGATCCGCA GTGGGTTGCT AAGGTCGAGG AGCTCATGGA GGCAGTCGAC 25 0 

AC CTTCGTGC CGACTCCTGA GCGCGACATG GACAAGCCGT TCCTCATGCC 3 00 

3 5 GATCGAAGAC GTCTTCACCA TCACCGGCCG TGGCACCGTT GTTACCGGTC 350 

GTGTTGAGCG TGGCAAGCTC CCGATCAACT CTGAGGTTGA AATCCTCGGT 400 

ATCCGCGAAC CGCAGAAGAC CACCGTTACC GGTATCGAGA TGTTCCACAA 450 

GTCCATGGAT GAAGCATGGG CAGGCGAGAA CTGTGGTCTC CTCCTGCGTG 5 00 

GC AC CAAGCG CGATGAGGTT GAGCGCGGTC AGGTCGTTGC CGTTCCCGGT 550 

40 TCGATCACCC CGCACACCAA CTTCACCGGA CAGGTCTACA TCCTCAAGAA 600 

GGAAGAAGGC GGTCGTCACA ACCCGTTCTT CTCGAACTAC CGTCCGCAGT 650 

TCTACTTCCG CACCACGGAC GTGACCGGCG TCATCACCC 689 

45 

2 ) INFORMATION FOR SEQ ID NO : 3 6 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 804 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium flavescens 

(B) STRAIN: ATCC 10340 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 
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GGTTGTTGCT GCAACCGATG GTCCTATGCC GCAGACCCGC GAGCACGTTC 50 

TTCTGGCTCG CCAGGTTGGC GTTCCTTACA TCCTCGTTGC TCTTAACAAG 100 

TGCGACATGG TTGATGATGA GGAAATCATC GAGCTCGTTG AGATGGAAAT 150 

5 CCGCGAACTG CTCGCTGAGC. AGGACTACGA CGAGGATGCC CCCATCATCC 200 

ACATCTCCGC TCTCAAGGCT CTTGAGGGTG ACGAGAAGTG GGTACAGGCC 250 

ATCGTCGACC TCATGCAGGC CTGCGATGAC TCCATTCCGG ATCCGGAGCG 300 

CGAGACCGAC AAGCCCTTCC TCATGCCTAT CGAGGACATC TTCACCATCA 3 50 

CCGGCCGCGG TACCGTTGTT AC CGGCCGTG TTGAGCGTGG CGTTTTGAAG 400 

10 GTCAACGAGG ATGTTGAGAT CATCGGCATC AAGGAGAAGT CCATCTCCAC 450 

CACCGTTACC GGTATCGAAA TGTTCCGCAA GATGATGGAC TACACCGAGG 500 

CTGGCGACAA CTGTGGTCTG CTTCTGCGTG GTACCAAGCG TGAAGAGGTC 55 0 

GAGCGCGGCC AGGTTGTTAT CAAGCCGGGC GCCTACACCC CCCACACCAA 600 

GTTCGAGGGT TCCGTCTACG TCCTCAAGAA GGAAGAGGGC GGCCGCCACA 650 

15 CCCCGTTCAT GGACAACTAC CGTCCGCAGT TCTACTTCCG TACCACTGAC 7 00 

GTGACCGGCG TTGTTCACCT GCCTGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACGTT GATATGACCG TTGAGCTCAT CCAGCCCGTC GCTAGGATGA 8 00 

GGGC 804 



20 



35 



2 } INFORMATION FOR SEQ ID NO: 37 



( i ) SEQUENCE CHARACTERISTICS : 

2 5 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebetcterium kutscheri 

(B) STRAIN: ATCC 15677 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 



TGCCTCAGAC CCGTGAGCAC GTTCTTCTTG CTCGCCAGGT TGGCGTTCCT 50 

TACATCCTCG TTGCTCTTAA CAAGTGCGAC ATGGTTGACG ATGAGGAAAT 100 

40 CATCGAGCTC GTTGAGATGG AAGTTCGCGA GCTTCTTGCT GAGCAGGAGT 150 

ACGATGAAGA GGCTCCAATC ATCCACATCT CTGCTTTGAA GGCTCTTGAG 2 00 

GGCGACGAGA AGTGGACTCA GGCCATCATC GACCTCATGC AGGCTTGTGA 250 

TGACTCCATC CCAGATCCAG AGCGTGAGAC CGACAAGCCA TTCCTCATGC 300 

CTATCGAGGA TATCTTCACC ATCACCGGTC GTGGCACCGT TGTTACCGGT 3 50 

45 CGTGTTGAGC GCGGTTCCTT GAAGGTGAAT GAGGACGTCG AGATCATCGG 400 

CATCAAGGAG AAGTCCACCA CTACTACCGT TACCGGTATC GAAATGTTCC 45 0 

GTAAGCTTCT TGATTACACC GAAGCTGGCG ATAACTGTGG TCTGCTTCTT 500 

CGTGGTATCA AGCGCGAAGA CGTTGAGCGT GGTCAGGTTG TTGTTAAGCC 550 

AGGCGCTTAC ACACCTCACA CCGAGTTCGA GGGCTCTGTT TACGTTC TTT 600 

50 CCAAGGACGA GGGCGGCCGC CACACCCCAT TCTTCGACAA CTACCGTCCA 650 

CAGTTCTACT TCCGCACCAC TGACGTTACC GGTGTTGTGA AG 692 



55 2 ) INFORMATION FOR SEQ ID NO: 3 8 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium minutissimum 

(B) STRAIN: ATCC 23348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 



CCTGGTTGTT GCTGCAACCG ATGGCCCGAT GCCGCAGACC CGCGAGCACG 5 0 

TTCTTCTGGC CCGCCAGGTT GGCGTTCCGT ACATCCTCGT TGCACTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAAATC ATCGAGCTCG TTGAGATGGA 15 0 

GATCCGTGAG CTGCTCGCTG AGCAGGACTA CGACGAGGAA GCTCCGATCG 2 00 

15 TTCACATCTC CGCTCTGAAG GCTCTTGAGG GCGACGAGAA GTGGGCACAG 2 50 

TCCATCGTTG ACCTGATGCA GGCTTGCGAT GACTCCATCC CGGATCCGGA 300 

GCGCGAGCTG GACAAGCCGT TCCTGATGCC GATCGAGGAC ATCTTC AC C A 350 

TTACCGGCCG CGGTACCGTT GTTACCGGCC GTGTTGAGCG TGGCTCCCTG 400 

AACGTTAACG AGGACATCGA GATCATCGGT ATCAAGGACA AGTCCATGTC 4 50 

2 0 CACCACCGTT ACCGGTATCG AGATGTTCCG CAAGATGATG GACTACACCG 500 

AGGCTGGCGA CAACTGTGGT CTGCTTCTGC GTGGTACCAA GCGTSAASAG 55G 

GTTGAGCGTG GCCAGGTTTG CATCAAGCCG GGCGCTTACA CCCCGCACAC 600 

C AAG TTCGAG GGTTCCGTCT ACGTCCTGAA GAAGGAAGAG GGCGGCCGCC 65 0 

ACACCCCGTT CATGGACAAC TACCGTCCGC AGTTCTACTT CCGCACCACC 7 00 

2 5 GACGTCACCG GTGTCATCAA GCTGCCGGAG GGC AC C GAGA TGGTCATGCC 7 50 

GGGCGACAAC GTTGAGATGT CCGTAGAGCT GATCCAGCCG GTCGCTA 7 97 



30 2) INFORMATION FOR SEQ ID NO : 39 

(i " ) SEQUENCE ^CHARACTERISTICS : 

(A) LENGTH: 702 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium mycetoides 

(B) STRAIN: ATCC 21134 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



GCCGCAGACC CGCGAGCACG TTCTTCTGGC CCGCCAGGTC GGCGTCCCCT 50 

ACATCCTCGT TGCGC TGAAC AAGTGCGACA TGGTTGATGA TGAGGAGATC 100 

ATCGAGCTCG TGGAGATGGA GGTCCGTGAG CTGCTCGGCG AGCAGGACTA 150 

CGACGAGGAC GCCCCCATCA TCCACATCTC CGCTCTGAAG GCTCTCGAGG 2 00 

50 GCGACGAGAA GTGGGTTCAG TCCGTGCTCG ACCTCATGCA GGCGTGCGAC 250 

GACTCCATCC CGGATCCGGT CCGCGAGACC GACCGCGACT TCCTGATGCC 3 00 

GATCGAGGAC ATCTTC AC C A TCTCCGGCCG CGGCACCGTG GTTACCGGTC 3 50 

GTGTGGAGCG CGGCGTGCTC AACCTCAACG ACGAGGTCGA GATCATCGGC 400 

ATCCGCGACA AGTCCCAGAA GACCACCGTC ACCTCCATCG AGATGTTCAA 450 

55 CAAGCTGCTC GATACCGCTG AGGCAGGCGA CAACGCGGCT CTGCTGCTCC 500 

GCGGTCTGAA GCGCGAGGAC GTCGAGCGTG GCCAGGTTGT CATCAAGCCG 550 

GGCGCCTACA CCCCGCACAC CAAGTTCGAG GGTTCCGTCT ACGTCCTGTC 600 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTTCTACTT CCGCACCACC GACGTGACCG GTGTTGTGAA GCTGCCGGAG 7 00 

60 GG 702 

228 
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2 ) INFORMATION FOR SEQ ID NO: 40 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 674 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Corynebacterium pseudogenitaliuxn 

(B) STRAIN: ATCC 33038 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

2 0 GCTCGCCAGG TTGGCGTTCC TTACATCCTC GTTGCGCTGA ACAAGTGCGA 50 

CATGGTTGAT GATGAGGAAA TCATCGAGCT CGTTGAGATG GAGATCCGTG 100 

AGCTGCTCGC AGAGCAGGAT TACGATGAGG AAGCTCCTAT CGTTCACATC 150 

TCCGCTCTGA AGGCCCTCGA GGGCGATGAC AAGTGGGTAC AGTCCGTCGT 2 00 

TGATCTGATG GAAGCCTGCG ACAACTCCAT CCCGGATCCG GAGCGCGCTA 25 0 

25 CCGACCAGCC GTTCCTGATG CCTATCGAGG ACATCTTCAC CATTAC CGGC 300 

CGCGGTACCG TTGTTACCGG CCGTGTTGAG CGTGGCCGTC TGAACGTCAA 3 50 

CGAGGACGTT GAGATCATCG GTATCCAGGA GAAGTCCCAG ACCACCACCG 400 

TTACCGGTAT CGAGATGTTC CGCAAGATGA TGGAC TAC AC CGAGGCTGGC 450 

GACAACTGTG GTCTGCTTCT GCGTGGTACC AAGCGTGAGG ACGTTGAGCG 500 

3 0 TGGCCAGGTT GTTATCAAGC CGGGCGCTTA CACCCCGCAC ACCAAGTTCG 55 0 

AGGGCTCCGT CTACGTCCTG AAGAAGGAAG AGGGCGGCCG CCACACCCCG 600 
TTCATGAACA ACTACCGTCC GCAGTTCTAC TTCCGTACCA CGGACGTTAC 65 0 

CGGTGTTGTT CACCTGCCAG AGGG 674 



35 



50 



2 ) INFORMATION FOR SEQ ID NO: 41 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 694 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium renale 
<B) STRAIN: ATCC 19412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



TGCCTCAGAC CCGTGAGCAC GTTCTGCTTG CTCGTCAGGT CGGCGTTCCT 5 0 

TACATCCTCG TTGCACTGAA CAAGTGCGAC ATGGTCGACG ACGAAGAAAT 100 

55 CATCGAGCTC GTCGAGATGG AAATCCGTGA ACTGCTCGCA GAGCAGGACT 150 

ACGATGAGGA AGCTCCTATC GTTCACATCT CCGCTCTGGG CGCCCTGAAC 200 

GGCGAGCAGA AGTGGGTTGA CTCCATCGTC GAACTGATGG AAGCTTGCGA 2 50 

CAACTCCATC CCAGACCCAG TTCGCGACAT CGACCACCCA TTCCTGATGC 3 00 

CTATCGAGGA CATCTTCACC ATTACCGGTC GCGGTACCGT TGTTACCGGC 3 50 

60 CGTGTCGAGC GTGGCCGTCT CAACGTCAAC GAAGAAGTTG AGATCATCGG 400 
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TATCAAGGAC AAGTCCCAGA AGACCACCGT 
GCAAGATGCT GG AC T AC AC C GAAGCTGGCG 
CGCGGCATCG GCC GTGAGG A TGTCGAGCGT 
AGGCGCTTAC ACCCCTCACT CTGAGTTCGA 
5 CCAAGGACGA GGGTGGCCGC CACACCCCAT 
CAGTTCTACT TCCGCACCAC CGACGTGACC 



CACCGGTATC GAGATGTTCC 450 

ACAACTGTGG TCTGCTGCTC 500 

GGCCAGGTTA TCATCAAGCC 550 

GGGCTCTGTC TACGTCCTGT 600 

TCTTCGACAA CTACCGTCCA 650 

GGCGTTGTGC ACCT 694 



10 2 ) INFORMATION FOR SEQ ID NO: 42 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 687 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynsha. ct erium ul-cerans 

(B) STRAIN: NCTC 8665 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



GCCGCAGACC CGCGAGCACG TTCTGCTGGC TCGCCAGGTT GGCGTTCCKT 50 

ACATCCTSGT TGCACTGAAC AAGTGCGACA TGGTTGACGA TGAGGARCTC 100 

CTSGAGCTCG TCGAGATGGA GGTCCGCGAG CTGCTGGCTG AGCAGGACTA 150 

CGACGAGGAA GCTCCGRTCG TTCACATCTC CGCWCTGAAC GCCCTGGACG 200 

30 GCGACSAGAA GTGGGCTVAC TCCATCCTCG AGCTGATGCA GGCTTGCGAC 250 

-GAGTCCATCC -CGGATCCGGA GCGCGAGACC GACAAGCCGT TCCTGATGCC 300 

GATTGAGGAC ATCTTCACCA TTACCGGTCG CGGYACCGTT GTTACCGGCC 350 

GTGTTGAGCG TGGCDTCCTG AACGTSAACG ACGASGTTGA GATCATGGGY 400 

ATC CGGGAG A AGTCCCAGAA GACCACCGTY ACCKSCATCG AGATGTTCAA 450 

3 5 CAAGMTGMTG GACWCCGCAG AGGCTGGCGA CAACGCTGSW CTGCTGCTGC 500 

GTGGTMTSAA GCGTGAGGAC GTTGAGCGTG GCC AG ATC AT CGYTAAGCCG 550 

GGCGCKTACA CCCCGCACAC CGAGTTCGAG GGCTCCGTCT ACGTCCTGTC 600 

CAAGGACGAG GGCGGCCGCC ACACCCCGTT CTTCGACAAC TACCGTCCGC 650 

AGTTCTACTT CCGCACCACC GACGTSACCG GTGTTGT 687 

40 



2 ) INFORMATION FOR SEQ ID NO: 43 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticxim 
55 (B) STRAIN: ATCC 43 042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC GTGAGCACGT 50 
60 TCTGCTGGCT CGCCAGGTTG GCGTTCCGTA CATCCTCGTT GCACTGAACA 100 
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15 



AGTGCGACAT 
GTCCGCGAGC 
CCCGATCTCC 
CCATCCTCGA 
CGCGAGACCG 
TACCGGTCGC 
ACCTGAACGA 
ACCACCGTCA 
GGCTGGCGAC 
TCGAGCGAGG 
GAGTTCGAGG 
CACCCCGTTC 
ACGTCACCGG 
GGCGACAACG 



GGTTGACGAT 
TTC TGGCTG A 
GCACTGGGCG 
GCTCATGAAG 
ACAAGCCGTT 
GGCACCGTCG 
CGAGGTCGAG 
CCTCCATCGA 
AACGCTGCAC 
CCAGATCATC 
GCTCCGTCTA 
TTCGACAACT 
TGTCGTTACC 
TTGAGATGAG 



GAGGAGCTCC 
GCAGGACTAC 
CCCTGGACGG 
GCTTGCGACG 
CCTGATGCCG 
TTACCGGCCG 
ATCCTGGGCA 
GATGTTCAAC 
TGCTGCTGCG 
GCTAAGCCGG 
CGTCCTGTCC 
ACCGTCCGCA 
CTGCCAGAGG 
CGTCAAGC 



TCGAGCTCGT 
GACGAGGAGG 
CGATCAGAAG 
AGTCCATCCC 
GTTGAGGACA 
TGTTGAGCGT 
TCCGCGAGAA 
AAGCTGCTGG 
TGGTCTGAAG 
GCGCTTACAC 
AAGGACGAGG 
GTTCTACTTC 
GCACCGACAT 



CGAGATGGAG 150 

CTCCGGTCGT 2 00 

TGGGTCGACT 2 50 

GGACCCGGAG 300 

TCTTCACCAT 350 

GGCGTCCTGA 400 

GTCCACCAAG 450 

ACACCGCAGA 500 

CGTGAGGACG 550 

CCCGCACACC 600 

GCGGCCGTCA 65 0 

CGTACCACCG 7 00 

GGTCATGCCG 75 0 
778 



20 



25 



30 



35 



40 



45 



50 



2 ) INFORMATION FOR SEQ ID NO : 44 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 703 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium xerosis 

(B) STRAIN: ATCC 373 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 



CGCAGACCCG 
ATCCTCGTCG 
CGAGCTCGTG 
ACGAGGAGGC 
GAAGAGAAGT 
GAACGTTCCG 
TCGAGGACAT 
GTGGAGCGCG 
CCAGGAGAAG 
AGCTGCTGGA 
GGCATCAAGC 
CGCCTACACC 
AGGACGAGGG 
TTCTACTTCC 
CAC 



TGAGCACGTC 
CCCTGAACAA 
GAGATGGAGG 
CCCGATCGTG 
GGGTCG AC TC 
GACCCGGTCC 
CTTCACCATC 
GCACCCTGAA 
TCCCAGACCA 
CTCCGCCGAG 
GCGAGGACAT 
CCGCACACCG 
CGGCCGCCAC 
GCACCACCGA 



CTCCTGGCCC 
GTGCGACATG 
TGCGTGAGCT 
CACATCTCCG 
CATCGTCGAG 
GCGAGACCGA 
ACCGGCCGCG 
GGTCAACGAC 
CCACCGTCAC 
GCCGGCGACA 
CGAGCGCGGC 
AGTTCGAGGG 
ACCCCGTTCT 
CGTCACCGGC 



GCCAGGTCGG 
GTCGACGATG 
TCTCGCCGAG 
CCCTGGGCGC 
CTCATGAACG 
CAAGCCGTTC 
GCACCGTCGC 
GAGGTCGAGA 
CGGCATCGAG 
ACTGTGGCCT 
CAGATCATCG 
CTCCGTCTAC 
TCGACAACTA 
GTCGTGAAGC 



CGTCCCCTAC 50 

AGGAGATCAT 100 

CAGGACTACG 15 0 

CCTCAATGGC 200 

CCGTCGACGA 2 5 0 

CTGATGCCCG 3 00 

CACCGGTCGC 3 50 

TCCTGGGCAT 4 00 

ATGTTCCGCA 450 

GCTGCTCCGC 500 

CGAAGCCGGG 550 

ATCCTGGCCA 600 

CCGTCCGCAG 650 

TGCCGGAGGG 7 00 
703 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 45 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE; 



Genomic DNA 

231 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

GGAGCGATAT TGGTGGTGAG CGCAGCGGAC GGCCCGATGC CGCAAACGCG 50 

GGAACACATT GTATTGGCGA AG C AAGTGGG TGTTCCGAAC ATAGTGGTTT 100 

10 ACTTGAACAA AGCGGACATG GTGGATGACA AAGAGCTGTT GGAATTAGTG 15 0 

GAAATGGAAG TGAGGGATTT ATTGAACAGT TATGATTTCC CTGGGGATGA 2 00 

GACGCCGATA ATAGTGGGGT CAGCGTTAAA GGCGTTAGAA GGTGACAAGA 250 

GTGAGGTTGG GGAGCCATCG ATAATCAAAT TAGTGGAAAC GATGGACACG 3 00 

TACTTCCCGC AGCCGGAGCG AGCGATAGAC AAACCGTTTT TAATGCCGAT 350 

15 CGAAGATGTG TTTTCGATAT CGGGCCGAGG GACGGTGGTG ACGGGACGCG 40 0 

TAGAGCGAGG GATCATCAAA GTGGGC G AC G AGATAGAGAT TGTGGGGATC 450 

AAGGACACGA CGAAGACGAC GTGCACGGGC GTTGAGATGT TTCGCAAATT 500 

ATTGGATGAA GGTCAAGCGG GTGACAACGT AGGAATTTTA TTGAGAGGGA 55 0 

CGAAACGCGA AGAAGTGGAG CGTGGTCAAG TATTGGCGAA ACCGGGATCG 600 

20 ATCACGCCAC ACAAGAAATT TG AGGCG G AG ATTTATGTGT TGTCGAAGGA 65 0 

AGAAGGGGGA CGCCACACAC CGTTTTTACA AGGCTATCGA CCGCAATTTT 7 00 

ATTTCCGCAC GACGGACGTG ACGGGCCAGT TATTGAGTTT ACCGGAGGGG 750 

ATAGAGATGG TGATGCCGGG AGATAACGTG AAAGTGACGG TTGAATTGAT 800 

TGCGCCGGTA GCGATGGATG AAGGGCTACG AT 832 

25 



2 ) INFORMATION FOR SEQ ID NO: 46 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i el la hoshinae 
40 (B) STRAIN: ATCC 33379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

45 TGAGCACATC CTGCTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 2 00 

TACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 2 50 

AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACGCTGGA CTCCTACATT 300 

50 CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 350 

CGTATTCTCA ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTAGG TATCAAGCCG 450 

ACCACCAAGA CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 550 

55 GTGACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC AGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGTTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACCACTGA CGTGAC TGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 800 

60 CGCCATGGAC GATGGT 816 
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2 ) INFORMATION FOR SEQ ID NO : 47 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 
<B) TYPE: Nucleic acid 
(C) STRANDEDNESS: Double 

10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

20 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGTTGGGTC GCCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAGATGGAAG TTCGCGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACGCCGGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

25 AGTGGGAAGC GAAGATCATC GAACTGGCTG AAACTCTGGA CTCCTACATC 300 

CCGGAACCTG AGCGTGACAT CGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCT ATCTCTGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTAGGC GACGAAGTTG AAATCGTTGG TATCAAGCCG 450 

ACCACCAAGA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

3 0 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

~ GTG ACGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCACT 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGAGCA AGGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GT ACT AC TGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

3 5 GTAATGCCGG GCGACAACAT CAAGATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCCATGGAC GATGGTCTGC G 821 



40 2 ) INFORMATION FOR SEQ ID NO: 48 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
45 <C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23 834 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 



CGGTGCCATC CTGGTGGTAT CCGCTGCTGA CGGCCCCATG CCTCAGACTC 50 

GCGAACACAT CCTGTTGGCT CGTCAGGTAG GTGTACCCTA CATCCTCGTA 100 

TTCATGAACA AATGCGACAT GGTAGATGAT GCCGAGCTGC TTGAGTTGGT 150 

TGAGATGGAA ATCCGCGACC TGCTCTCCAG CTATGACTTC CCTGGTGACG 2 00 

60 ACTGCCCGAT CGTACAAGGT TCCGCTCTCA AAGCCCTCGA AGGCGATGCC 250 
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GGTTACAAAG AAAAAATCTT CGAACTAGCT 
CCCCACTCCT CAACGTGCTG TAGACAAACC 
ACGTATTCTC TATCTCCGGC CGTGGTACCG 
CGCGGCATCA TCAAAGTAGG TGAAGAGATC 
5 CACTCAGAAA ACTACCTGTA CTGGCGTGGA 
ACGAAGGTCA GGCCGGTGAC AACGTAGGCG 
CGTGAAGAAG TTGAGCGTGG TCAAGTATTG 
TCCGCACACC AAGTTCAAAG CCGAAGTATA 
GTGGTCGTCA CACCCCGTTC TTTGCCAACT 
10 CGTACTACTG ACGTAACCGG TGC TG TAG AG 
GGTTATGCCT GGTGAGAACG TAACCATCAC 
TTGCTATGGA AGAAGGTCTG CGCTTTGCGA 



GCTGCTTTGG ATAGCTACAT 3 00 

CTTCCTGTTG CCGATCGAAG 3 50 

TAGTAACCGG TCGTGTAGAG 400 

GAAATCGTTG GTCTGAAGCC 450 

AATGTTCCGC AAACTGCTGG 500 

TACTGCTGCG CGGTACCAAA 550 

GCTAAACCCG GCACCATCAC 600 

CGTATTGAGC AAAGAAGAAG 650 

ACCGTCCACA GTTCTACTTC 7 00 

CTGGAGCCTG GTGTAGAAAT 750 

CGTAGAACTG ATTGCTCCGA 800 

830 



15 

2) INFORMATION FOR SEQ ID NO : 49 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 808 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter aerogenes 

(B) STRAIN: ATCC 13048 



30 (xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 49 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 15 0 

35 GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGACGA 2 00 

CACTCCGATC GTTCGTGGTT C TGC TC TG AA AGCGCTGGAA GGCGACGCAG 2 50 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

CCRGAACCAG AGCGTGCGAT TGACAAGCCG TTCC TGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

40 GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGCGAAAA CCACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGYGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CAGCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCCA AAGACGAAGG 65 0 

45 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGRTG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGG ~ 808 

50 



2 ) INFORMATION FOR SEQ ID NO : 50 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGTGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 15 0 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ATACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAMGCW 250 

GAGTGGGAAG CGAAAATCAT CGARCTGGCT GGCCACCTGG ATACCTATAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGCGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TYAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACYGCGAAA TCAACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 55 0 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 60 0 

2 0 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGTCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCSCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 75 0 

GGTAATGC CG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTCGCA 82 8 

25 

2 ) INFORMATION FOR SEQ ID NO : 51 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 
40 (B) STRAIN: ATCC 33072 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACGC 50 

45 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 2 00 

ACACTCCAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 25 0 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCTTACAT 3 00 

50 CCCGGAACCA GAACGTGCTA TCG ATAAGC C ATTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC TATC TCCGGC CGTGGTACTG TTGTAACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 55 0 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG G CTAAGC C AG GCTCAATCAA 600 

GCCGCACACC AAATTCGAAT CTGAAGTTTA TATTCTGTCC AAAGATGAAG 65 0 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACCGG CACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACA TTCAGATGGT TGTTACCCTG ATCCACCCAA 80 0 

60 TCGCGATGGA TGACGGTCTG CGTTT 82 5 
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2 ) INFORMATION FOR SEQ ID NO : 52 

5 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: En terobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 

2 0 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTT CATQATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 15 0 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ATACTCCAAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCGA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

3 0 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCGAAGCCAG GCTCAATCAA 600 

GCCACACACC AAGTTCGAAT C TGAAGTGT A CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

35 GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTCTG CG 822 



40 2 ) INFORMATION FOR SEQ ID NO: 53 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 

(B) STRAIN: ATCC 35317 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCAGGCGACG 200 

60 ACACTCCAAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAAGCT 250 
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GAGTGGGAAG CAAAAATCAT CGAACTGGCT 
CCCAGAACCA GAGCGTGCGA TTGACAAGCC 
ACGTATTCTC CATCTCCGGT CGTGGTACCG 
CGC GGTATC A TCAAAGTTGG TGAAGAAGTT 
5 TACTGCKAAA TCTACCTGTA CTGGCGTTGA 
ACGAAGGCCG TGCTGGTGAG AACGTTGGTG 
CGCGAAGAAA TCGAACGTGG TCAGGTTCTG 
GCCACACACC AAGTTCGAAT CTGAAGTGTA 
GCGGCCGTCA TACTCCGTTC TTCAAAGGCT 
10 CGTACAACTG ACGTGACCGG TAC CATC GAA 
GGTAATGCCA GGCGACAACA TCAAGATGGT 
TCGCGATGGA CGACGGTCTG CGTTTC 



GGCTTCCTGG ATTCTTACAT 3 00 

ATTCCTGCTG CCAATCGAAG 350 

TTGTTACCGG TCGTGTAGAG 400 

GAAATCGTTG GTATCAAAGA 450 

AATGTTC CGC AAACTGCTGG 500 

TTCTGCTGCG TGGTATCAAA 5 50 

GCGAAGCCAG GCTCAATCAA 600 

CATCCTGTCC AAAGACGAAG 650 

ACCGTCCACA GTTCTACTTC 7 00 

CTGCCAGAAG GCGTAGAGAT 7 50 

TGTGACGCTG ATCCACCCAA 800 

826 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO: 54 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 806 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobac ter cloacae 

(B) STRAIN: ATCC 13 047 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 

GATCCTGGTA GTAGCTGCGA CTGACGGCCC AATGCCTCAG ACTCGTGAGC 50 

ACATCCTGCT GGG TCGTC AG GTAGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAATGCG ACATGGTTGA TGACGAAGAG CTGCTGGAAC TGGTAGAGAT 150 

3 5 GGAAGTTCGT GAACTGCTGT CTCAGTACGA TTTCCCAGGC GACGATACCC 2 00 

CAATCGTTCG TGGTTCTGCT CTGAAAGCGC TGGAAGGCGA CGCAGAGTGG 2 50 

GAAGMGAAAA TCATCGAACT GGCTGGCTAC CTGGATTCTT ACATCCCAGA 3 00 

AC C AGAGCGT GCGATTGAYA AGCCATTCCT GCTGCCAATC GAAGACGTAT 3 50 

TCTCCATCTC CGGTCGTGGT ACCGTTGTTA CCGGTCGTGT AGAGCGCGGT 400 

40 ATCATCAAAG TGGGTGAAGA AGTTGAAATC GTTGGTATCA AAGAGACTGC 450 

GAAGTCTACC TGTACTGGCG TTGAAATGTT CCGCAAACTG CTGGACGAAG 500 

GCCGTGCTGG TGAGAACGTT GGTGTTCTGC TGCGTGGTAT CAAACGTGAA 5 50 

GAAATCGAAC GTGGTCAGGT TCTGGCGAAG CCAGGCTCAA TCAAGCCACA 600 

CACCAAGTTC GAATCTGAAG TGTACATCCT GTCCAAAGAC GAAGGCGGCC 6 50 

45 GTCATACTCC GTTC TTC AAA GGCTACCGTC CACAGTTCTA CTTCCGTACA 7 00 

ACTGACGTGA CCGGTACCAT CGAACTGCCA GAAGGCGTAG AGGTGGTAAT 7 50 

GCCAGGCGAC AACATCAAGA TGGTTGTGAC TCTGATCCAC CCAATCGCGA 8 00 

TGGACG 806 



2 ) INFORMATION FOR SEQ ID NO : 55 



( i } SEQUENCE CHARACTERISTICS : 

5 5 (A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

6 0 (ii) MOLECULE TYPE: Genomic DNA 

237 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergovlae 

(B) STRAIN: ATCC 3 3 028 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AGTGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 15 0 

AGAGATGGAA GTTCGTGAAC TGCTGTCTCA GTACGATTTC CCGGGCGACG 2 00 

ACACCCCGAT CGTTCGCGGT TCTGCGCTGA AAGCGCTGGA AGGCGACGCA 2 50 

GAGTGGGAAG C GAAAATC AT CGAACTGGCT GGCCACCTGG ATACCTAYAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATTTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCA TCAAGGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 45 0 

CACCGCGAAA ACCACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTCGGCG TTCTGCTGCG TGGTATCAAG 55 0 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCCATCAA 600 

20 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATCCTGTCC AAAGAC G AAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACGGTCGGCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 75 0 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTC 82 6 

25 



2 ) INFORMATION FOR SEQ ID NO : 56 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 
40 (B) STRAIN: ATCC 49162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAG AGCTGC T GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CAGGCGACGA 2 00 

CACCCCAATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAMGCAG 250 

AGTGGGAAGM GAAAATCATC GARCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCAGAACCAG AGCGTGCGAT TGACAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTWGAGC 4 00 

GCGGTATCAT CAAAGTAGGT GAAGAAGTTG AAATC GTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

55 GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CGAAGCCAGG CTCAATCAAG 600 

CCACACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACAT CAAGATGGTT GTGACGCTGA TCCACCCAAT 800 

60 CGCGATGGAC GACGGTCTGC GTTTCGCAA 82 9 

238 
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2 } INFORMATION FOR SEQ ID NO : 57 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: En terobacter sakazakii 

(B) STRAIN: ATCC 29544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

2 0 GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 15 0 

GAGATGGAAG TGCGCGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACCCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCTG 2 50 

25 AGTGGGAAGC GAAAATCATC GAGCTGGCAG GTC AC CTGGA TTCCTACATC 3 00 

CCGGAACCGG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCYGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAGGTTGGT GAAGAAGTTG AAATCGTGGG CATCAAAGAC 450 

AC CGCGAAAT CCACCTGTAC CGGCGTTGAA ATGTTCCG CA AACTGCTGGA 5 00 

30 . CGAAGGCCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC . -550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCCATCAAG 6 00 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTAC RACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT C 831 



40 2 ) INFORMATION FOR SEQ ID NO : 58 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

{ ii ) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 2 5788 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 



CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATATGACTTC CCAGGCGACG 2 00 

60 ATGTTCCTGT AATCGCTGGT TCTGCTTTGA AAGCTCYTGA AGGCGATGCT 2 50 

239 
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TCATACGAAG AAAAAATCAT GGAATTAATG 
TCCAACTCCA GAACGTGACA CTGACAAACC 
ACGTATTCTC AATCACTGGA CGTGGTACTG 
CGTGGACAAG TTCGCGTTGG TGACGAAGTT 
5 AGAAACTGCT AAAACAACTG TAACTGGTGT 
TAGACTATGC TGAAGCAGGG GATAACATTG 
GCTCGTGAAG AC AT C C AACG TGGACAAGTA 
CACACCTCAT ACAAAATTTA AAGCTGAAGT 
AAGGTGGACG TCACACACCA TTCTTCACTA 
10 TTCCGTACAA CTGACGTAAC TGGTGTTGTT 
AATGGTTATG CCTGGTGATA ACGTAACAAT 
CAATCGCTAT CGAAGACGGA ACTCGTTTCT 



GCTGCAGTTG ACGAATACGT 300 

ATTCATGATG CCAGTCGAAG 350 

TTGCTACAGG CCGTGTTGAA 400 

GAAATCGTTG GTATTGCTGA 450 

TGAAATGTTC CGTAAATTGT 500 

GTGCATTGCT ACGTGGTGTT 550 

TTGGCTAAAG CTGGTACAAT 600 

TTACGTTTTA ACAAAAGAAG 650 

ACTACCGTCC TCAGTTCTAC 7 00 

GAATTACCAG AAGGAACTGA 7 50 

CGACGTTGAA TTGATCCACC 800 

CAATT 835 



15 

2 ) INFORMATION FOR SEQ ID NO: 59 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 826 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 59 



GGTGC TATCT TAGTAGTATC TGCTGCTGAT GGTCCTATGC CACAAACTCG 50 

TGAACACATT CTTTTATCAC GTAACGTTGG TGTTC CAT AC ATCGTTGTTT 100 

TCTTAAACAA AGTTGATATG GTTGACGACG AAGAATTATT AGAATTAGTT 150 

3 5 GAAATGGAAG TACGTGACTT ATTAACTGAA TACGACTTCC CAGGAGACGA 2 00 

TGTTC CTGTA ATCGCTGGTT CTGCATTAAA AGCTTTAGAA GGCGACCCAT 250 

CTTACGAAGA AAAAATCTTA GAATTAATGG CTGCAGTTGA CGAATACATC 3 00 

CCAACTCCAG AACGTGACAA CGATAAACCA TTCATGATGC CAGTCGAAGA 3 50 

CGTATTTTCA ATC AC TGGTC GTGGTACTGT TGCTACAGGT CGTGTTGAAC 4 00 

40 GTGGACAAGT ACGTGTTGGT GACGAAGTTG AAATAGTTGG TATCCATGAT 450 

GAAATTTCTA AAACAACAGT T AC TGGTGTT GAAATGTTCC GTAAATTATT 500 

AGATTACGCT GAAGCTGGAG ACAACATCGG TGCATTATTA CGTGGTGTGG 550 

CTCGTGAAGA TATCCAACGT GGTCAAGTAT TAGCTAAACC AGGTTCAATC 600 

ACTCCACATA CAAAATTCAC TGCTGAAGTG TACGTTTTAA C T AAAGAAG A 650 

45 AGGTGGACGT CATACTCCAT TCTTCACTAA CTACCGTCCA CAATTCTACT 7 00 

TCCGTACAAC TGACGTTACA GGTGTAGTTA ACTTACCAGA AGGTACTGAA 7 50 

ATGGTTATGC CTGGTGATAA CGTAAC TATG GAAGTTGAAT TAATCCACCC 800 

AATCGC T ATC GAAGACGGAA CTCGTT 82 6 



50 



2 ) INFORMATION FOR SEQ ID NO : 60 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

240 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

CGGCGCGATC TTGGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT C C TATTGTC A CGTAACGTTG GTGTTCCTTA CATCGTCGTT 100 

10 TTCTTGAACA AAATGGACAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTGTCAGA ATACG AC TTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGCCTTAGA AGGCGACGCT 2 50 

TCATATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GTTCGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

15 ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCAACTGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAATCGTAG GTATCGCTGA 45 0 

AGAAACTGCT AAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TGGATTACGC TGAAGCTGGC GACAACATTG GTGCATTATT ACGTGGTGTG 550 

GCTCGTGAAG ATATCCAACG TGGTCAAGTA TTATCAAAAC CAGGTTCAAT 600 

2 0 CACTCCACAT ACAAAATTTG CGGCAGAAGT TTACGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC AC AATTC T AC 7 00 

TTCCGTACAA C TG ACGT AAC AGGTGTTGTT GAATTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTTACTAT GGACGTTGAA TTAATCCACC 800 

. CAATCGCGAT CGAAGACGGT ACTCGTTTCT CAATC 835 



2 ) INFORMATION FOR SEQ ID NO : 61 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases - 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS : Double 
( D ) TOPOLOGY : Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
40 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 50 

45 GTGAACATAT CCTATTATCT CGTCAAGTTG GTGTTCCTTA CATCGTYGTA 100 

TTCTTGAACA AAGTAGATAT GGTCGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTGA AAGC TTT AG A AGGCGACGCT 250 

TCATACGAAG AAAAAATCCT TGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

50 CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTTGAAG 3 50 

ATGTATTCTC RATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGRCTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 550 

5 5 GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACKCCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC AC AATTC T AC 7 00 

TTCCGTACAA C TGACGT AAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

6 0 CAATCGCTAT CGAAAATGGT AC T AAATTC T CAATC 835 

241 
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2) INFORMATION FOR SEQ ID NO: 62 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 



2 0 AGTAGTTTCT 
TATTATC AC G 
ATGGATATGG 
TCGTGACTTA 
TCGCAGGTTC 

2 5 AAAATC TT AG 
ACGTGATACT 
TCACTGGACG 
CGCGTTGGTG 
AACAACTGTT 

30 AAGCAGGCGA 
ATCGAACGTG 
* AAAATTCAAA 
ACACTCCATT 

35 



GCTGCTGATG 
TAACGTTGGT 
TTGATGACGA 
TTATCAGAAT 
TGC TTTG AAA 
AATTAATGGC 
GACAAACCAT 
TGGTACTGTT 
ACGAAGTTGA 
ACAGGTGTTG 
CAACATCGGT 
GACAAGTATT 
GC TGAAGT AT 
CTTCACTAAC 



GTCCTATGCC 
GTAC CAT AC A 
AGAATTATTA 
ACGATTTCCC 
GCTTTAGAAG 
TGCAGTTGAC 
TCATGATGCC 
GCTACAGGCC 
AATCGTTGGT 
AAATGTTCCG 
GCTTTATTAC 
AGCTAAACCA 
ACGTATTATC 
TACCGTCCTC 



TCAAACACGT 
TCGTTGTATT 
GAATTAGTAG 
AGGCGATGAT 
GCGACGAGTC 
GAATATATCC 
AGTCGAAGAC 
GTGTTGAACG 
ATTAAAGACG 
TAAATTATTA 
GTGGTGTAGC 
GCTACAATCA 
AAAAGAAGAA 



GAACATATCT 50 

CTTAAACAAA 10 u 

AAATGGAAGT 15 0 
GTTCCAGTTA ■ 200 

TTATGAAGAA 2 50 

CAACTCCAGA 3 00 

GTATTCTCAA 3 50 

TGGTGAAGTT 400 

AAACATCTAA 450 

GACTACGCTG 5 00 

AC GTG AAG AT 550 

CTCCACACAC 600 

GGCGGACGTC 65 0 
680 



40 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 63 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 680 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R487 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



AGTAGTTTCT 
TATTATC AC G 

55 ATGGATATGG 
TCGTGACTTA 
TCGCAGGTTC 
AAAATCTTAG 
ACGTGATACT 

60 TCACTGGACG 



GCTGCTGATG 
TAACGTTGGT 
TTGATGACGA 
TTATCAGAAT 
TGCTTTGAAA 
AATTAATGGC 
GACAAACCAT 
TGGTACTGTT 



GTCCTATGCC TCAAACACGT 
GTAC CAT AC A TCGTTGTATT 
AGAATTATTA GAATTAGTAG 
ACGATTTCCC AGGCGATGAT 
GCTTTAGAAG GCGACGAGTC 
TGCAGTTGAC GAATATATCC 
TCATGATGCC AGTCGAAGAC 
GCTACAGGCC GTGTTGAACG 

242 



GAACATATCT 50 

C TT AAAC AAA 100 

AAATGGAAGT 150 

GTTCCAGTTA 2 00 

TTATGAAGAA 250 

CAACTCCAGA 3 00 

GTATTCTCAA 3 50 

TGGTGAAGTT 400 
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CGCGTTGGTG ACGAAGTTGA AATCGTTGGT 
AACAACTGTT AC AGGTG TTG AAATGTTCCG 
AAGCAGGCGA CAACATCGGT GCTTTATTAC 
ATCGAACGTG GACAAGTATT AGCTAAACCA 
5 AAAATTCAAA GCTGAAGTAT AC GT ATTATC 
ACACTCCATT CTTCACTAAC TACCGTCCTC 



ATTAAAGACG AAACATCTAA 450 
TAAATTATTA GACTACGCTG 500 
GTGGTGTAGC ACGTGAAGAT 550 
GCTACAATCA CTCCACACAC 600 
AAAAGAAGAA GGCGGACGTC 650 

680 



10 2 ) INFORMATION FOR SEQ ID NO: 64 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faeclum 

(B) STRAIN: R4 82 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 



AGTAGTTTCT GCTGCTGACG GCCCAATGCC TCAAACTCGT GAACACATCC 50 

TATTGTCTCG TCAAGTTGGT GTTCCTTACA TCGTTGTATT CTTGAACAAA 100 

GTAGACATGG TTGATGACGA AGAATTACTA GAATTAGTTG AAATGGAAGT 150 

TCGTGACCTA TTAACAGAAT ACGAATTCCC TGGTGACGAT GTTCCTGTAG 2 00 

3 0 TTGCTGGATC AGC TTTG AAA GCTCTAGAAG GCGACGCT.TC ATACGAAGAA 25 0 

AAAATTC TTG AATTAATGGC - TGC AGTTGAC GAATACATCC CAACTCCAGA 3 00 

ACGTGACAAC GACAAACCAT TCATGATGCC AGTTGAAGAC GTGTTCTCAA 3 50 

TTACTGGACG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGACAAGTT 400 

CGCGTTGGTG ACGAAGTTGA AGTTGTTGGT ATTGCTGAAG AAACTTCAAA 45 0 

35 AACAACAGTT ACTGGTGTTG AAATGTTCCG TAAATTGTTA GACTACGCTG 500 

AAGC TGG AG A CAACATTGGT GCTTTACTAC GTGGTGTTGC ACGTGAAGAC 550 

ATCCAACGTG GACAAGTTTT AGCTAAACCA GGTACAATCA CACCTCATAC 600 

AAAATTC TCT GCAGAAGTAT ACGTGTTGAC AAAAGAAGAA GGTGGACGTC 65 0 

ATACTCCATT CTTCACTAAC TACCGTCCTC AATTT 685 

40 



2 ) INFORMATION FOR SEQ ID NO: 65 

45 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
55 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTAYG CCTCAAACAC 50 
60 GTGAACACAT CTTGTTATCA CGTAACGTTG GTGTACCATA CATCGTTGTT 100 

243 
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TTCTTAAACA 
TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
5 TCCAACTCCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAG AC T ATGC 

10 GCTCGTGAAG 
CACACCTCAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 

15 CAATCGCTAT 



AAATGGATAT 
GTTCGTGACT 
AATCGCTGGT 
AAAAAATCAT 
GAACGTGACA 
AATC AC TGG A 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCCAACG 
ACAAAATTTA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGA 



GGTTGATGAC 
TATTGTCAGA 
TCTGCTTTGA 
GGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCACTA 
TGGTGTTGTT 
AMGTAACAAT 
ACTCG 



GAAGAATTAC 
ATATGACTTC 
AAGCTC TTG A 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGCATTGCT 
TTAGCTAAAG 
TTACGTTTTA 
ACTACCGTCC 
GAATTACCAG 
CGACGTTGAA 



TAGAATTAGT 150 

CCAGGCGACG 2 00 

AGGCGATGCT 2 50 

ACGAATACGT 300 

CCAGTCGAAG 350 

CCGTGTTGAA 400 

GTATTGCTGA 450 

CGTAAATTGT 500 

ACGTGGGGTT 550 

CTGGTACAAT 600 

ACAAAAGAAG 650 

TCAGTTCTAC 7 00 

AAGGAACTGA 750 

TTGATCCACC 800 
825 



2 ) INFORMATION FOR SEQ ID NO : 66 



20 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

{ i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM : Enterococcus galllnarum 

(B) STRAIN: R420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

3 5 TCCTATGCCT CAAACTCGTG AACACATCTT GTTATCACGT AACGTTGGCG 50 

TACCATACAT CGTTGTTTTC TTGAACAAAA TGGATATGGT TGATGACGAA 100 

G AATTGC TAG AATTAGTTGA AATGGAAGTT CGTGACCTAT TGTCTGAGTA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCCGGTTCT GCTTTGAAAG 200 

CTCTTGAAGG AGATCCTTCA TACGAAGAAA AAATCATGGA ATTGATGGCT 250 

40 GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGATACTG ACAAACCATT 3 00 

CATGATGCCA GTCGAAGACG TATTCTCAAT CACTGGACGT GGT AC TGTTG 350 

CTACAGGCCG TGTTG AACGT GGACAAGTTC GCGTTGGTGA TGAAGTAGAA 400 

ATCGTTGGTA TTGCTGACGA AACTGC TAAA ACAACTGTAA CAGGTGTTGA 450 

AATGTTCCGT AAATTGTTAG ACTATGCTGA AGCAGGGGAT AACATTGGTG 500 

45 CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTG 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTCAAAG CTGAAGTTTA 600 

TGTTTTGACA AAAGAAGAAG GTGGACGTCA CACTCC 636 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 67 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 835 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

244 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 8043 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACATAT CCTAYTATCT CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCTTGAACA AAGTAGATAT GGTTGACGAC GAAGAATTAC TAGAATTAGT 150 

10 TGAAATGGAA GTTCGTGACT TATTAACAGA ATACGAATTC CCTGGTGACG 2 00 

ATGTTCCTGT AGTTGCTGGT YCAGCTTTGA AAGCTTTAGA AGGCGACGCT 2 50 

TCATACGAAG AAAAAATCCT TGAATTGATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGACA ACGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

15 CGTGGACAAG TTCGCGTTGG TGACGTTGTA GATATCGTTG GTATCGCAGA 450 

AGAAACAGCT CAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGA GACAACATTG GTGCTTTACT ACGTGGTGTT 55 0 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

2 0 AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC AC AATTC TAC 700 

TTCCGTACRA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAACGGT ACTAAATTCT CAATC 835 



25 



40 



2) INFORMATION FOR SEQ ID NO : 68 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 835 bases 

<B) TYPE: Nucleic acid 
<C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 



CGGAGCAATC TTAGTTGTTT CTGCTGCTGA CGGCCCTATG CCTCAAACTC 50 

GTGAACACAT CCTATTATCT CGTCAAGTTG GTGTACCATA CATCGTTGTA 100 

45 TTCTTGAACA AAGTAGATAT GGTTGATGAC GAAGAATTAC TTGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACGAATTC CCTGGTGACG 200 

ATGTTCCTGT AATCGCTGGT TCAGCTTTAA GAGCTTTAGA AGGCGACGCT 250 

KCATACGAAG AAAAAATTCT TGAATTGATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA GAACGTGATA ACGACAAACC ATTCATGATG CCAGTTGAGG 35 0 

5 0 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG AC GTGTTG AA 400 

CGTGGACAAG YTCGTGTTGG TGACGTTATC GATATCGTTG GTATCGCAGA 45 0 

AGAAACAGCT CAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GATAACATTG GTGCGTTACT ACGTGGTGTT 55 0 

TCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTAAAC CAGGTACAAT 600 

55 CACACCTCAT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC ACAATTCTAC 7 00 

TTCYGTACGA CTGACGTAAC TRGTGTTGTY GAATTACCAG AAGGAACTGA 7 50 

AATGGTTATG CCTGGCGACA ACGTAACAAT GGAAGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAAATGGT ACTAAATTCT CAATC 83 5 

60 



245 



CA 02307010 2000-05-19 



2) INFORMATION FOR SEQ ID NO : 69 



10 



15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 



CGGAGCTATC 

20 GTGAACACAT 
TTC TTAAAC A 
TGAAATGGAA 
ACACTCYAGT 
TC AT AC RAAG 

25 CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
AGAAACTGCT 
TAGACTACGC 

3 0 GCACGTGAAG 
CACTCCACAT 
AAGGCGGGCG 
TTCCGTACAA 
AATGGTAATG 

3 5 CAATCGCGAT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGAGT 
TATCGCTGGT 
AAAAAATCTT 
GTTCGTGATA 
AATCACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGC 
ACATCCAACG 
-ACAAAATTCT 
TCACACTCCG 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGA 



CTGCTGCTGA 

CGTAACGTTG 

GGTTGATGAC 

TATTGTCAGA 

TCAGCYTTGA 

AGAATTAATG 

CTGACAAACC 

CGTGGTACTG 

TGACGAAGTT 

TTACAGGTGT 

GATAACATCG 

TGGACAAGTA 

GTGGAGAAGT- 

TTCTTCACTA 

TGGTGTTGTT 

AC GTAACT AT 

ACTCGTTTCT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGATTTC 
AAGCTTTAGA 
SCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCATTATT 
TTGGCTAAAC 
TTACGTTTTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 
CTATTC 



CCTCAAACAC 50 

CATCGYTGTA 100 

TAGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACCCT 25 0 

ACGAATACAT 300 

CCAGTCGAAG 3 50 

TCGTGTTGAA 400 

GTATCGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 7 00 

AAGGTACTGA 7 50 

TTAATCCACC 800 
836 



40 



2 ) INFORMATION FOR SEQ ID NO : 70 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus rafflnosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 0 

55 CGGAGCTRTC TTAGTAGTAT CTGCTGCTGA TGGCCCTATG CCTCAAACTC 5 0 

GTGAACACAT CTTGTTATCT CGTAACGTTG GTGTTCCTTA CATCGTTGTA 100 

TTC TTAAAC A AAATGGATAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAACTGA ATACGAC TTC CCAGGCGACG 200 

ACACTCCAGT TATCGCAGGT TCAGCTTTGA AAGC CTTAG A AGGCGACGCT 250 

60 TCATACGAAG AAAAAATCTT AGAATTAATG GCTGCTGTTG ATGAATACAT 3 00 

246 
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CCCAACACCA GTTCGTGATA 
ACGTAYTCTC AATCACTGGT 
CGTGGACAAG TTCGCGTTGG 
AGAAACTGCT AAAACAACTG 
5 TGGATTACGC TGAAGCGGGC 
GCACGTGAAG ACATCCAACG 
CACTCCACAT ACAAAATTCT 
AAGGCGGACG TCATACTCCA 
TTCCGTACAA CTGACGTAAC 
10 AATGGTAATG CCTGGTGATA 
CAATCGCGAT CGAAGACGGA 



CTGACAAACC ATTCATGATG 
CGTGGAACTG TTGCAACTGG 
TGACGAAGTT GAAATCGTAG 
TTACAGGTGT TGAAATGTTC 
GACAACATTG GTGCATTATT 
TGGACAAGTA TTGGCTAAAC 
CTGCAGAAGT TTACGTTTTA 
TTC TTC ACTA ACTACCGTCC 
TGGTGTAGTT GATCTACCAG 
ACGTAACTAT GGAAGTTGAA 
ACTCGTTTCT CTATT 



CCAGYGGAAG 3 5 0 

TCGTGTTGAA 400 

GTATTGCTGA 45 0 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 7 50 

TTAATCCACC 800 
835 



15 2 ) INFORMATION FOR SEQ ID NO : 71 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43 07 6 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 



CGGCGCGATC TTAGTAGTAT CTGCTGCTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCT CGTAACGTAG GTGTTC CTT A CATCGTTGTA 100 

TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCAGA ATACGATTTC CCAGGCGATG 2 00 

35 ACACTCCAGT TATTGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGATC C A 2 50 

GTTTACGAAG AAAAAATCTT CGAATTAATG GCTGCAGTTG ACGAATATAT 3 00 

CCCAACTCCA G AAC GTGATA CTGAAAAACC ATTCATGATG CCAGTTGAGG 3 50 

ATGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG TCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGACGTTGTA G AAATC GTTG GTATCGACGA 45 0 

40 AG AAAC AGCT CAAACTACTG TAACAGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GATAACATCG GTGC TTTATT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTAGCTAAAC CAGGAACAAT 600 

CACTCCTCAT ACAAAATTCG TAGCTGAAGT TTACGTTTTA ACTAAAGAAG 65 0 

AAGGTGG AC G TCATACTCCA TTC TTC ACTA ACTACCGTCC TCAATTCTAC 7 00 

45 TTCCGTACAA CTGACGTAAC TGGTGTTGTA GAATTACGCG AAGGTACTGA 7 50 

AATGGTAATG C C TGGTGAC A ACGTAACTAT CGACGTTGAA TTAATCCACC 800 

CAATCGCTAT CGAAGACGGA ACTCGTTTCT CTATT 835 



50 

2 ) INFORMATION FOR SEQ ID NO: 72 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 49428 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

GAGCTATCTT GGTAGTTTCT GCAGCTGATG GCCCAATGCC ACAAACTCGT 50 

GAACATATTT TGTTGTCACG TAATGTAGGT GTACCTTACA TCGTTGTGTT 100 

CTTGAACAAA ATGGATATGG TTGATGACGA AGAATTACTT GAGTTAGTTG 15 0 

10 AAATGGAAGT ACGTGATCTA TTATCTGAAT ACGACTTCCC AGGAGATGAT 200 

ACTCCAGTTA TTTCCGGTTC AGCTTTGAAA GCTTTAGAAG GCG AC GAAG A 25 0 

ATATGAACAA AAAATTATGG ACTTAATGGA TGCAGTTGAT GACTACATTC 3 00 

CAACTCCTGA ACGTGACCAT GACAAACCAT TCATGATGCC AATTGAAGAT 35 0 

GTATTTTCAA TTACAGGCCG TGGTACTGTT GCTACAGGAC GTGTTGAACG 400 

15 CGGGACTATC AAAGTCGGCG ATGAAGTTGA CATTATTGGT ATTCATGAAG 450 

ACGTTAAAAA GACAACAGTT ACTGGTGTAG AAATGTTCCG TAAATTGTTG 500 

GACTACGCTG AAGCAGGCGA TAACATTGGT AC TTTGTT AC GTGGTGTTTC 550 

TCGTGATGAT ATCGAACGTG GTCAAGTATT AGCTAAACCA GGTTCAATCA 600 

CACCACATAC AAGATTCTCT GCTGAAGTTT ATGTTTTGAC TAAAGAAGAA 650 

2 0 GGCGGACGTC ATACTCCATT CTTCTCAAAC TATCGTCCTC AATTCTACTT 700 

CCGTACAACT GATATCACTG GTGTCATTGA ATTGCCAGAA GGTACTGAAA 750 
TGGTAATGCC AGGTGATAAT GTAACAATGG ATGTTGAATT AATCCACCCA 800 
GTCGCTATCG AAGAAGGAAC TCG 823 

25 

2 ) INFORMATION FOR SEQ ID NO : 7 3 

( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 83 5 bases 

(B) _~~TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 3 

CGGTGCAATC TTGGTCGTAT CAGCGACAGA TGGCCCAATG CCACAAACAC 50 

GGGAGCATAT TTTGCTTTCT CGTCAAGTGG GTGTGAAACA TTTGATCGTC 100 

45 TTTTTGAATA AGACGGACCT TGTCGATGAT GACGAGTTGA TCGATTTAGT 150 

TGAAATGGAA GTCAGAGAAT TGCTGACTGA ATATGATTTT CCTGGCGACG 2 00 

ACATTCCTGT GATCAAGGGC TCTGCGTTAA AAGCCTTGGA AGGGGACCCA 250 

GATGCTGAAG CAGCGATCTT AACGCTGATG GATACAGTAG ATGAATATAT 300 

CCCAACGCCA GAACGTGATA CTGACAAACC ATTGTTGTTA CCGATCGAAG 350 

50 ATGTCTTTTC GATCACAGGA CGGGGG ACCG TTGCTTCTGG TCGGATCGAT 400 

CGCGGCATGG TAAAAGTCGG GGATGAAGTA GAAATCGTCG GAATCAAACC 450 

TGAAACACAA AAAGCAGTCG TGACAGGGGT AGAAATGTTC CGCAAAACGA 500 

TGGACTTCGG AGAAGC TGGC GATAACGTAG GGGTATTGTT ACGGGGCATC 550 

ACCCGTGATG AAATTGAACG TGGCCAAGTG TTAGCAAAAC CAGGTTCTAT 600 

55 CACACCGCAT ACGAAATTCC AAGCGGAAGT CTATGTGTTG ACAAAAGAAG 650 

AAGGCGGTCG CCATACCCCA TTCTTTAATA ATTATCGCCC ACAATTTTAC 700 

TTCCGTACAA CGGACGTAAC TGGGAATATC GTTTTACCAG AAGGAACGGA 750 

AATGGTGATG CCTGGTGACA ACGTAACGAT CGATGTGGAA TTG ATC CATC 800 

CGATCGCTGT AGAAAATGGA AC G AC CTTCT CGATT 835 

60 
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10 



2) INFORMATION FOR SEQ ID NO: 74 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
15 (B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 

TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

20 AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

AGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACTTGCTT 200 

TTTTAAGTAT AGGTATAAAA TACGATTGAT TAAAACAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

25 TACAACAACG CTGATTGGCG CGTT AC T ATT ATATTTGCCA ATCACAGGTA 3 50 

AACATCCTAT TGATTTTGTG GACGCCCGTT 3 80 



30.. 2 ) INFORMATION FOR SEQ ID NO: 75. 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 666 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: ATCC 49996 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 



GCAATCTTGG TCGTATCAGC GACAGATGGC CCAATGCCAC AAACACGGGA 5 0 

GCATATTTTG CTTTCTCGTC AAGTGGGTGT GAAACATTTG ATCGTCTTTT 100 

TGAATAAGAC GGACCTTGTC GATGATGACG AGTTGATCGA TTTAGTTGAA 150 

ATGGAAGTCA GAG AATTGC T GACTGAATAT GATTTTCCTG GCGACGACAT 200 

50 TCCTGTGATC AAGGGCTCTG CGTTAAAAGC CTTGGAAGGG GACCCAGATG 250 

CTGAAGCAGC GATCTTAACG C TGATGG AT A CGGTAGATGA ATATATCCCA 300 

ACGCCAGAAC GTGATAC TG A CAAACCATTG TTGTTACCGA TCGAAGATGT 350 

CTTTTCGATC ACAGGACGGG GGACCGTTGC TTCTGGTCGG ATCGATCGCG 400 

GCATGGTAAA AGTCGGGGAT GAAGTAGAAA TCGTCGGAAT CAAACCTGAA 450 

55 ACACAAAAAG C AGTCG TG AC AGGGGTAGAA ATGTTCCGCA AAACGATGGA 500 

CTTCGGAGAA GCTGGCGATA ACGTAGGGGT ATTGTTACGG GGCATCACCC 550 

GTGATGAAAT TGAACGTGGC CAAGTGTTAG CAAAACCAGG TTCTATCACA 600 

CCGCATACGA AATTCCAAGC GGAAGTCTAT GTGTTGACAA AAGAAGAAGG 6 50 

CGGTCGCCAT ACCCCA 666 



249 
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2 ) INFORMATION FOR SEQ ID NO : 7 6 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 51 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
15 (B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 6 

TGGTGCGATT TTAGTTGTAT CCGCAACAGA TGGTCCAATG CCTCAAACCC 50 

20 GGGftftCftMCT CTTGCTTTCG AGACAAGTTG GTGTGAAACA TCTGATTGTT 10 0 

TTCTTGAACA AAATCGATTT AGTCGATGAC GAAGAATTGA TTGATTTAGT 15 0 

AGAAATGGAA GTAAGAGAAC TGCTATCTGA ATATAATTTT CCAGGGGATG 2 00 

ACATTCCTGT TATCAAAGGT TCGGCATTAA AAGCGTTGGA AGGAGACCCT 2 50 

GATGCAGAAG CTGCCATCAT GGAATTAATG GATACAGTAG ACAGCTATAT 3 00 

25 CCCAACACCT GAGCGTGATA CAGACAAACC ATTACTCTTG CCAGTTGAAG 3 50 

ATGTC TTTTC G ATT AC TGG A CGAGGAACAG TTGCTTCCGG ACGGATCGAT 400 

CGGGGAACAG TTCGGGTAGG CGATGAAGTA GAAATCGTCG GTATCAAACC 45 0 

TGAAACCCAA AAAGCTGTAG TGACAGGCGT CGAAATGTTC CGCAAGACGA 50 0 

TGGACTTTGG GGAAGCCGGT GACAATGTAG GTGTCTTGCT GAGAGGGATC 550 

30 ACTCGTGACG AAATTGAACG AGGACAAGTG TTGGCTAAAC C AGGTTC GAT 600 

GACACCACAT ACAAAATTCC AAGCAGAAGT TTATGTATTG ACGAAAGAAG 650 

AAGGTGGTCG TCATACACCA TTCTTCAACA ACTATCGTCC ACAATTTTAT 700 

TTCCGTACAA CGGATGTGAC AGGGAACATT ACATTGCCTG AAGGAACAGA 750 

A 751 

35 



2 ) INFORMATION FOR SEQ ID NO : 77 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia canis 
50 (B) STRAIN: Florida 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 



TGCAGCAATA 
55 GAGAACATAT 
TGGATGAATA 
TGAAATGGAA 
ATATAGATGT 
GGCTCAGGTG 
60 AAAAATAAGT 



TTAGTAGTGT 
ATTATTAGCA 
AGTGTGATGT 
ATAAGGGAAT 
AGTTAGAGGA 
TGTGGAGTGA 
TTACCAGTAA 



CTGCAACTGA TGGAGCAATG 
AAGCAAGTAG GTGTAAAAGA 
TGTAGATGAT GAAGAAATGT 
TGTTATCAAA ATATGGGTAT 
TCTGCAGTTA AAGCATTAGA 
AAAAATAATG GAATTGATGA 
GAGAAAAAGA TAAGCCATTT 

250 



CCACAAACAA 50 

TATAGTAGTG 100 

TGTCATTAGT 150 

CCTGGGGATG 200 

AGAAGAAACA 250 

ATGC TTTAG A 3 00 

TTAATGTCAA 3 50 
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TAGAAGATGT GTTTTCAATA 
ATAGAAAGAG GAGTAATTAG 
GCGTGAGATA CAAAGTACAG 
CATTAGATGC AGGAGAAGCA 
5 ATAAAAAAAG AAGATGTAGA 
G AT AC ATTC A TATAAGAGAT 
AAGAAGGAGG AAGACATACT 
TATGTTAGAA CAACAGATGT 
AGAAATGGTA ATGCCAGGGG 
10 AGCCTGTTGC TATTGATCAA 



CCTGGAAGAG GTACAGTAGT 
AGTAGGGGAT AAAATAGAGA 
TATGTACAGG TGTTGAAATG 
GGGGATAATG CTGGAATATT 
AAGAGGGCAA GTATTGAGTG 
TTAAGGCAGA GGTATATATA 
CCATTTTTCT CAAATTACCA 
AACAGGGAAT ATAAAGTTAC 
ATAATATAAA TATCGAAGTG 
GGATTGAGAT TTGC 



AACAGGAAGA 4 00 

TAGTAGGATT 450 

TTTCATAAAG 500 

GTTAAGAGGG 5 50 

CACCTGGACA 600 

TTGAAAAAAG 650 

GCCGCAATTT 700 

CAGAAGGAGT 750 

AGTTTGGATA 800 
834 



2 ) INFORMATION FOR SEQ ID NO: 7 8 



15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

3 0 CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

3 5 GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTAYAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTC CTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

40 ACGAAGGCCG TGC TGGTG AG AACGTAGGTG TTCTGCTGCG TGGT ATC AAA 550 

C GTG AAG AAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 6 50 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GC GTAG AG AT 7 50 

45 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGT 817 



50 2 ) INFORMATION FOR SEQ ID NO : 79 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 ( vi ) ORIGINAL SOURCE: 

251 
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(A) ORGANISM: Escherichia fergusonii 

(B) STRAIN: ATCC 35469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 

5 CGATCCTGGT AGTTGCTGCG ACTGACGGCC CGATGCCGCA 
CACATCCTGC TGGGTCGTCA GGTAGGCGTT CCGTACATCA 
GAACAAGTGC GACATGGTTG ATGACGAAGA GCTGCTGGAA 
TGGAAGTTCG TGAACTTCTG TCTCAGTACG ACTTCCCGGG 

10 CCGATCGTTC GTGGTTCTGC TCTGAAAGCG CTGGAAGGCG 
GGAAGCGAAA ATCCTGGAAC TGGCTGGCTT CCTGGATTCT 
AACCAGAGCG TGCGATTGAC AAGCCGTTCC TGCTGCCGAT 
TTCTCCATCT CCGGTCGTGG TACCGTTGTT ACCGGTCGTG 
TATCATCAAA GTTGGTGAAG AAGTTGAAAT CGTTGGTATC 

15 AG AAGTC TAC CTGTACTGGC GTTGAAATGT TCCGCAAACT 
GGCCGTGCTG GTGAGAACGT AGGTGTTCTG CTGCGTGGTA 
AGAAATCGAA CGTGGTCAGG TACTGGCTAA GCCGGGCACC 
AC AC C AAGTT CGAATCTGAA GTGTACATTC TGTCCAAAGA 
CGTCATACTC CGTTCTTCAA AGGCTACCGT CCGCAGTTCT 

20 TACTGACGTG ACTGGTACCA TCGAACTGCC GGAAGGCGTA 
TGCCGGGCGA CAACATCAAA ATGGTTGTTA CCCTGATCCA 
ATGGACGACG GTCTGCGTTT CGCAA 



GAC TCGTG AG 50 

TCGTGTTCCT 100 

CTGGTTGAAA 150 

CGACGACACT 2 00 

ACGCAGAGTG 250 

TACATTCCGG 3 00 

CGAAGACGTG 350 

TAGAACGCGG 400 

AAAGAGACTC 450 

GCTGGACGAA 500 

TCAAACGTGA 550 

ATCAAGCCGC 600 

TGAAGGCGGT 65 0 

ACTTCCGTAC 700 

G AGATGGTAA 7 5 0 

CCCGATCGCG 8 0 0 
825 



25 



35 



60 



2 ) INFORMATION FOR SEQ ID NO: 80 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
3 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GG AACTGGTT 150 

45 GAGATGGAAG TTCGCGAACT GCTGTCCCAG TACGATTTCC CGGGCGACGA 200 

CACCCCGATC GTTCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 

AGTGGGAAGA GAAAATCATC GAACTGGCTG GCTACCTGGA TTCCTATATC 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 350 

CGTATTCTCC ATCTCCGGCC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

5 0 GCGGTATCAT CAAAGTGGGT GAAGAAGTTG AAATCGTGGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG TTCCATCAAG 600 

CCKCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTC C A AAGACGAAGG 650 

5 5 CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTTGAGATG 7 50 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



252 
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2) INFORMATION FOR SEQ ID NO: 81 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 



15 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

2 0 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGTGAAC TTCTGTCCCA GTACGACTTC CCGGGCGACG 2 00 

ACACCCCGAT CATTCGTGGT TCTGCGCTGA AAGCGCTGGA AGGCGAAGCT 2 50 

GAGTGGGAAG AGAAAATCGT TGAGCTGGCT GGCTACCTGG ATTCCTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

2 5 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAARGTKGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

TACTGCGAAA TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCG TGCAGGCGAG AACTGCGGCG TTCTGCTGCG TGGTATCAAG 550 

CGTGAAGAGA TCCAGCGTGG CCAGGTTCTG GCTAAGCCGG GCTCAATCAA 600 

3 0 GCCGCACACC AAGTTCGAAT CCGAAGTGTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 7 50 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCATCCGA 800 

TCGCGATGGA CGACGG 816 

35 

2) INFORMATION FOR SEQ ID NO : 82 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacteri um lentum 
50 (B) STRAIN: ATCC 43055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 

CGGCGCCTCC TCGTTATCGC CGCCACCGAC GGCCCGATGG CCCAGACCCG 50 

55 CGAGCACATC CTGCTCGCCC GTCAGGTCGG CGTGCCCTAC ATCGTGGTCT 100 

TCCTGAACAA GTGCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAAG TTCGCGAGCT GCTCGACTCT TACGAGTTCC CGGGCGACGA 2 00 

CACCCCGATC ATCCGCGGCT CCGCTTTGAA GGCCCTCGAG GGCGACAAAG 250 

AGTGGCAGGA GAAGGTCTGG GAGCTCATGG ACGCCGTCGA CTCCTACATC 3 00 

60 CCGACGCCGG AGCGCATGGT CGACAAGCCG TTCCTGATGG CCGTCGAGGA 3 50 

253 
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CACGATGACC ATCACCGGCC GCGGCACCGT 
GTGGTACGCT GCATGTCAAC GACCCGCTGG 
ACCCAGAACA CGGTC TGC AC CGGTATCGAG 
CGAGGCTCAG GCCGGCGACA ACATCGGCTG 
5 GCGAGGAGAT CGTTCGCGGC CAGGTTCTCT 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGCCAC ACGCCGTTCT TCGACGGCTA 
GCACGACGGA CGTGACGGGT GTTGCCCACC 
GTCATGCCGG GCGACAACGT GGAGATCAAG 
10 CGCCAGGAAG AGGGCTGCGT TCGCTAAC 



TGCCACCGGT CGTGTGGAGC 4 00 

AGATCGTCGG TATCAAGGAG 45 0 

ATGTTCCGCA AGCTGCTCGA 500 

CCTGCTCCGC GGTGTCAAGC 550 

GCAAGCCCGG TAGCGTGACC 600 

ATCCTGACGA AGGAAGAGGG 65 0 

CCGTCCGCAG TTCTACTTCC 700 

TTCCCGAGGG CACCGAGATG 750 

GGCGAGCTCA TTCACCCGAT 800 

828 



2 ) INFORMATION FOR SEQ ID NO: 83 



15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Eubac terium nodatum 

(B) STRAIN: ATCC 33099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

3 0 GGAGCAATTC TGGTTTGTGC AGC AAC KG AC GGACCAATGC CTCAGACAAG 5 0 

AGAACATATC _CTjI^TGTCAA_jGGCAGGTAGG~AGTGCCATAT "ATCATCGTAT 100 

TCCTGAATAA ATGTGACATG GTGGATGAYG AAGAGCTTCT GGACTTGGTA 15 0 

GAGATGGAAG TAAGAGAACT TCTCAGTGAG TATGAATTCC CGGGAGATGA 2 00 

TACCCCGATA GTAAGAGGTT CAGCCCTGAA GGCACTGGAA GAACCCAATG 250 

35 GAGAATGGGC AGACAAGATT GTAGAGCTGA TGGAGGAAGT AGATAAATAC 3 00 

ATTCCTGAAC CAAAGAGAGA TAACGACAAA CCGTTCCTGA TGCCTGTAGA 350 

GGACGTATTC TCAATAACAG GAAGAGGAAC AGTAGCGACA GGAAGRGTTG 400 

AAAGAGGAAT CCTGAAGGTC GGTGATGAAG TAGAAATCGT GGGAATGAGC 450 

GAAGAGAGAA GAAAGGTAGT AGTAACGGGA GTTGAAATGT TCAGAAAGCT 500 

40 TCTGGATGAA GCAGAGACAG GAGACAACAT CGGAGCACTG CTGAGAGGAG 550 

TTCAGAGAAC RGAGATCCAG AGAGGTCAGG TATTGGCRGC ACCTGGAACG 600 

ATCAACCCAC ATACAAAGTT CAAGGGTCAG GTATATGTAC TGAAGAAGGA 650 

AGAAGGAGGA AGGCATACGC CGTTCTTCAA YGGATACAGW CCACAGTTCT 7 00 

ACTTCAGAAC AAC AG AC GT A ACAGGAGATT TGCAGCTGCC GGAAGGARCA 7 50 

45 GAGATGTGCA TGCCGGGAGA TAATGTGGTA ATGAACRTCA GCCTGATCAC 800 

TCCGATTGCT ATAGAAGAGG GWCTGAGATT TGCCA 835 



50 2 ) INFORMATION FOR SEQ ID NO: 84 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 (vi) ORIGINAL SOURCE: 

254 
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(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

GCGATCCTGG TTGTTGCTGC AACTGATGGC CCTATGCCAC AGACTCGTGA 50 

GCACATCCTG TTGGGTCGYC AGGTTGGCGT TCCATTCATG ATCGTATTCA 100 

TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTAGAA 15 0 

ATGGAAGTTC GYGAACTTCT GTCTGCTTAC GATTTCCCAG GCGACGACAT 2 00 

10 CCCAGTTGTT AAAGGTTCAG CGCTGAAAGC ACTGGAAGGC GAAGCAGAGT 2 50 

GGGAAGCTAA GATCATCGAG CTGGCTGGCC AC C TGG AT AG CTACATCCCA 3 00 

GAACCAGAGC GTGCGATTGA CAAGCCATTC CTGCTGCCAA TCGAAGACGT 3 50 

ATTCTC CATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAGCGCG 400 

GTATCATCAA AGTTGGCGAA GAAGTTGAAA TCGTTGGTAT CAAGGACACT 45 0 

15 GTTAAGTCTA C TTGTAC TGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCY GGTGAGAACG TTGGTGTTCT GCTGCGTGGT ATCAAGCGTG 5 50 

AAGACATCGA ACGTGGTCAG GTTCTGGCTA AACCAGGTTC AATCAAACCA 600 

CACACCACWT TYGACTCAGA AGTTTATATC CTGAGCAAAG ATGAAGGCGG 650 

CCGTCATACT CCGTTCTTCA AAGGCTACCG TCCACAGTTC TACTTCCGTA 7 00 

2 0 CAACTGACGT GACCGGTACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 7 50 

ATGCCAGGTG ACAACATCAA CATGRTAGTT ACCCTGATCC ACCCAATCGC 800 

GATGGATGAC GGTCTGCGTT TCGCAA 82 6 



25 



35 



60 



2) INFORMATION FOR SEQ ID NO: 85 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 8 bases 
30- (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Francisella tularensls 

(B) STRAIN: LVS 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 85 

TGGTGCT ATT CTAGTATGTT CTGCTGCGGA TGGTCCTATG CCACAAACTC 5 0 

GTGAGCACAT TCTGCTTTCT CGTCAAGTTG GTGTACCAAA AATCGTTGTT 10 0 

TTCTTAAACA AGTGTGACAT GGTTGATGAT GAAGAGTTAT TAGAGCTAGT 150 

45 TGAGATGGAA GTTCGTGAGC TTTTAGATCA GTATGAGTTC CCAGGTGATG 200 

ACACTCCAGT TATTATGGGT TCAGCTCTTA GAGCTATTGA AGGTGACGAA 250 

GCTTACGTTG AGAAAATTGT TGAGCTAGTT CAAGCTATGG ATG AC TAT AT 3 00 

TCCTGCTCCT GAGCGTGATA CTGAGAAGCC ATTTATTCTT CCGATCGAAG 350 

ATGTATTCTC AATTTCAGGT CGTGGTACTG TTGTAACTGG TCGTATTGAG 400 

5 0 CGCGGTGTAG TTAACGTTGG TGATGAAGTT GAAGTTGTTG GTATTCGTCC 450 

AACTCAAAAA ACTACAGTAA CTGGTGTGGA AATGTTCCGT AAGCTTTTAG 500 

ATAGAGGGGA AGCTGGTGAT AACGTTGGTA TCCTAGTTCG TGGACTTAAG 550 

AGAGATGATG TTGAGCGTGG ACAAGTATTA TGTAAGCCAG GTTCAATTAA 600 

GCCACATACT AAGTTTGAAG C TGAGGTTT A TGTATTATCT AAAGAAGAGG 65 0 

55 GTGGTAGACA TACTCCATTC TTCAAGGGAT ATAGACCACA ATTCTACTTC 700 

CGTACTACAG AC ATTAC TGG AGCTGTTGAG CTTCCAGAGG GTGTAGAAAT 750 

GGTTATGC C T GGTGATAACG TTAAGATGAC TATCACTCTA ATT AAC C C AA 800 

TCGCTAGGAT GAAGGGTTAC GTTTTGCA 82 8 
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2 ) INFORMATION FOR SEQ ID NO : 86 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphism 

(B) STRAIN: ATCC 10953 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 



CGGTGCTATC ATCGTAGKTG CTGCTACTGA TGGTCCGATG CCTCARACTC 5 0 

GTGAGCAYAT CYTGCTGGCT CGTCAGGTAA ACGTWCCKAG ACTGGTTGTA 100 

2 0 TTCATGAACA AGTGYGACAT GGTAGACGAC GCTGAAATGY TGGAACTCGT 15 0 

TGAAATGGAA ATGCGTGAAC TGCTTTCAGC YTACGAATTC GAYGGYGACA 2 00 

ACACTCCKTT CATTCAGGGT TCTGCTCTTG GTGCRTTGAA YGGCGTTGAA 2 50 

AAGTGGGAAG AGAAGGTTAT GGANCTGATG GATGCTTGCG AC AC TTGG AT 3 00 

TCCTTTGCCT CCGCGTGATA TTGAYAAACC GTTCTTGATG CCGGTTGAAG 3 50 

2 5 ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCTACTGG TCGTATCGAA 400 

GCTGGTGTTA TCCATGTAGG TGACGAAGTT GAAATCCTCG GTTTGGGTGA 450 

AGACAAGAAG TCTGTTGTAA CTGGTGTTGA AATGTTCCGC AAGTTGCTGG 500 

ATCAAGGTGA AGCTGGTGAC AACGTAGGTY TGTTGCTCCG TGGTATCGAC 550 

AAG AAC G AAA TCAAACGTGG TATGGTTCTT TGTAAGCCCG GTCAGATTAA 600 

3 0 ACCTCACTCT AAGTTCAAAG CTTCTATCTA CGTTTTGAAG AAAGAAGAAG 650 

GTGGTCGTCA CACTCCGTTC CACAACAAAT ACCGTCCTCA GTTCTATCTG 7 00 

CGTACTATGG ACTGTACAGG TGAAATCWCT CTTCCGGAAG GAACTGAAAT 7 50 

GGTAATGCCT GGTGATAACG TAGAAATCAC TGTAGAACTG ATCTACCCGG 800 

TAGCATTGAA CGTAGGTTTG CGTTTCGCT 829 

35 



2 ) INFORMATION FOR SEQ ID NO : 87 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella haemolysans 
50 (B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 

CTATCTTAGT AATCGCTGCT ACAGATGGAC CAATGGCTCA AACTCGTGAG 50 

5 5 CACATCCTAT TATCTCGTAA CGTTGGAGTA CCAAAAATCG TTGTATTCTT 100 

AAACAAATGT GATATGGTTG ATGACGAAGA GTTATTAGAA TTAG TTG AAA 150 

TGGAAGTTCG TGAACTATTA TCTGAATACG GATTCGACGG AGATGAACTA 2 00 

CCAGTAATCA AAGGTTCTGC TCTTAAAGCT CTTGAAGGAG ATGCAGATGC 250 

AG AAAAAGC T ATCATCGAAT TAATGGAAAC AGTTGACGAA TACATCCCAA 3 00 

60 CTCCAGAACG TGATAACGCT AAACCATTCA TGATGC C AG T TGAGGACGTA 350 
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TTCTCAATCA CAGGTCGTGG 
ACAAGTTAAA GTTGGAGACG 
CAGCTTCAAC TACTGTAACA 
TACGCTGAAG CAGGAGATAA 
5 TGAAGACATC GAACGTGGAC 
CACACACTCA ATTCGTAGCT 
GGACGTCACA CTCCATTCTT 
TACTACTGAC GTAACTGGTG 
TAATGC CTGG GGATAACGTA 
10 GCGATCGAAG AAGGAACTCG 



TACAGTTGCT ACTGGACGTG 
TAGTAGAAAT CGTTGGATTA 
GGTGTTGAAA TGTTCCGTAA 
CATCGGTGCA TTATTACGTG 
AAGTTTTAGC AGCTCCTAAA 
GACGTGTACG TATTATCTAA 
C AC AAAC T AC CGTCCTCAAT 
TAGTTACTTT ACCAGAAGGT 
TCAATCAACG TAGAACTTAT 
TTTCTCAA 



TTGAACGTGG 400 

ACTGAAGAAC 4 50 

ATTATTAGAT 500 

GTGTTGCTCG 55 0 

ACAATCACTC 600 

AGAAGAAGGT 6 50 

TCTACTTCCG 700 

ACTGAAATGG 7 50 

TTCTCCAATC 8 00 
828 



2) INFORMATION FOR SEQ ID NO: 88 



15 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 

3 0 . TCTTAGTAAT CGCTGCTACA GATGGTCCTA TGGCTCAAAC TCGTGAACAC 50 

ATCCTATTAT CTCGTAACGT TGGAGTACCT AAAATTGTTG T ATTC TT AAA 100 

CAAATGTGAT ATGGTTGATG ACGAAGAGTT ATTAGAATTA GTAGAAATGG 15 0 

AAGTTCGTGA AC TATTATCT GAATACGGAT TTGATGGAGA TGAACTACCA 2 00 

GTAATCAAAG GTTCAGCTCT TAAAGCTCTT GAAGGAGATG CAGATGCTGA 250 

3 5 AAAAGCTATC ATCGAATTAA TGGAAACAGT TGACGAGTAC ATCCCAACTC 3 00 

CAGAACGTGA TAACGCTAAA CCATTTATGA TGCCAGTTGA GG AC GTGTTC 35 0 

TCAATCACAG GTCGTGGTAC AGTTGC T AC T GGACGTGTTG AACGTGGACA 400 

AGTTAAAGTT GGTGACGTAG TAGAAATCGT TGGATTAACT GAAGAACCAG 450 

CTTCAACTAC TGTAACAGGT GTTGAAATGT TCCGTAAATT ATTAGATTAC 500 

40 GCTGAAGCAG GAGATAACAT CGGTGCATTA TTACGTGGTG TTGCTCGTGA 5 50 

AGATATCGAA CGTGGACAAG TTTTAGCAGC TCCTAAAACA ATCACTCCAC 600 

ATACTCAATT CGTAGCTGAT GTGTACGTAT TATCTAAAGA AGAAGGTGGA 65 0 

CGTCACACTC CATTCTTCAC AAACTACCGT CCACAATTCT ACTTCCGTAC 7 00 

TACTGACGTA ACTGGTGTAG TTACTTTACC AGAAGGTACT GAAATGGTAA 7 50 

45 TGCCTGGGGA CAACGTATCA ATCAACGTAG AACTTATTTC TCCAATCGCT 800 

ATCGAAGAAG GAACTCGTTT CTC 823 



50 2 ) INFORMATION FOR SEQ ID NO : 89 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Haemophilus act inomycetemcomi tans 

(B) STRAIN: ATCC 33384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 

5 

GCTATCTTAG TAGTAGCAGC AACCGACGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCGG GCGATGACAC 200 

10 CCCAATCGTA CGCGGTTCTG CAT TAAAAGC GCTTGAAGGC GATGCCGCAT 250 

GGGAAGAAAA AATCCTTGAA TTAGCAAACC ATTTAGATAC TTACATCCCG 3 00 

GAACCTGAGC GTGCTATCGA CCAACCGTTC CTTCTTCCAA TTGAAGATGT 350 

GTTCTCTATC TCCGGTCGTG GTACCGTAGT AACGGGTCGT GTTGAGCGCG 400 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTGGGTAT CAAACCGACT 450 

15 GCAAAAACCA CCGTAACCGG TGTTGAAATG TTCCGTAAAT T ACTTGAC G A 500 

AGGTCGTGCG GGTGAAAACA TCGGTGCATT ATTGCGTGGT ACTAAACGTG 5 50 

AAGAAATCGA ACGTGGTCAG GTATTGGCGA AACCGGGGTC AATCACCCCG 600 

CACACTGACT TC GAATCTG A AGTGTACGTA TTGTCCAAAG AAGAAGGTGG 650 

TCGTCATACT C C ATTCTTC A AAGGTT AC C G TCCACAATTC TATTTCCGTA 7 00 

2 0 CAACTGACGT AACCGGTACT ATCGAGTTAC CTGAAGGCGT GGAAATGGTT 7 50 

ATGCCTGGCG ATAACATCAA AATG AC CGT A TCCTTAATTC ACCCAATTGC 800 

GATGGACCAA GGTTTACGTT TCGCTATCG 82 9 



25 



35 



60 



2 ) INFORMATION FOR SEQ ID NO : 90 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus aphrophilus 

(B) STRAIN: ATCC 33389 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGATG 200 

AT AC AC C AAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 250 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTAG ATACTTACAT 300 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCC TTCTT CCAATTGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACTG TAGTAACAGG TCGTGTTGAG 400 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 45 0 

GACTGCGAAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 5 00 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGCACTAAA 550 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GCTCAATCAC 600 

TCCGCACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

55 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTC TATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GCGTGGAAAT 750 

GGTTATGCCT GGCGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA C C AAGGTTT A CGTTTCGCTA TCG 833 



258 
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2) INFORMATION FOR SEQ ID NO: 91 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 



CGGCGCTATC TTAGTTGTAG CAGCAACTGA TGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTATTAGGC CGCCAAGTTG GTGTTCCTTA CATCATCGTA 100 

20 TTCTTAAATA AATGCGATAT GGTAGATGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGATTTC CCAGGTGACG 200 

ATACTCCTAT CGTTCGTGGT TCAGCATTAC AAGCATTAAA TGGTGTGCCT 250 

GAGTGGGAAG AAAAAATCAT TGAATTAGCA CAACACTTAG ATTCTTATAT 3 00 

CCCTGAGCCT GAGCGTGCGA TTGATAAACC TTTCTTATTA CCAATCGAAG 350 

25 ACGTATTCTC AATTTCAGGT CGTGGTACAG TAGTAACCGG TCGTGTTGAG 400 

CGTGGTATCA TCAAATCAGG TGAAGAAGTT GAAATCGTAG GGATTAAAGA 450 

AACGACAAAA ACAACAGTAA C CGGTGTTG A GATGTTCCGT AAAC T ATT AG 500 

ACGAAGGTCG TGCGGGTGAA AACGTAGGTG CCTTATTACG TGGTACTAAA 55 0 

CGTGAAGAAA TCGAACGTGG TCAAGTATTA GCGAAACCAG GTACAATTAC 600 

3 0 ACCACACACT GATTTTGAAT CAGAAGTTTA TGTATTATCA AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGTT ATCGTCCTCA GTTCTACTTC 7 00 

CGYACAACGG ACGTAACAGG AACGATTGAA TTACCTGAAG ATGTTGAGAT 750 

GGTAATGCCT GGTGATAATA TCAAGATGAC AGTAAGCTTA ATTCACCCTA 800 

TCGCGATGGA CGAAG 815 

35 

2 ) INFORMATION FOR SEQ ID NO : 92 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 
50 (B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCAATG CCACAAACTC 50 

55 GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTC C ATA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTCA AT ATG AC TTC CCAGGTGACG 2 00 

ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCATTAAA TGGCGTAGCA 250 

GAATGGGAAG AAAAAATCCT TGAGTTAGCA AACCACTTAG ATACTTACAT 3 00 

60 CCCAGAACCA GAGCGTGCAA TTGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 
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ATGTGTTCTC AATCTCAGGT CGTGGTACAG 
CGTGGTATCA TCCGTACTGG TGATGAAGTA 
TACAGCAAAA AC T AC TGTAA CGGGTGTTGA 
ACGAAGGTCG TGCAGGTGAA AACATCGGTG 
5 CGTGAAGAAA TCGAACGTGG TCAAGTATTA 
GCCACACACT GACTTCGAAT CAGAAGTTTA 
GTGGTCGTCA TACTCCATTC TTCAAAGGTT 
CGTACAACTG ACGTAACTGG TACTATCGAG 
GGTAATGCCA GGCGATAACA TCAAGATGAC 
10 TCGCGATGGA CCAAGGTTTA CGTTTCGCAA 



TAGTAACTGG TCGTGTAGAA 400 

GAAATCGTAG GTATCAAAGA 450 

AATGTTCCGT AAATTACTTG 500 

CATTATTACG TGGT AC C AAA 550 

GCGAAACCAG GTTCAATCAC 600 

TGTATTATCA AAAGAAGAAG 650 

ACCGTCCACA ATTCTATTTC 7 00 

TTACCAGAAG GCGTAGAAAT 7 50 

AGTAAGCTTA ATCCACCCAA 800 

830 



15 



2 ) INFORMATION FOR SEQ ID NO: 93 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases > 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 0 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

3 0 TCTTAGTAGT AGCAGCAACA GACGGTCCAA TGCCACAAAC TCGTGAGCAC 50 

"ATCTTATTAG GTCGCCAAGT AGGTGTTCCA TACATCATCG T ATTC TT AAA 100 

CAAATGCGAT ATGGTTGACG ATGAAGAATT ATTAGAATTA GTTGAAATGG 150 

AAGTGCGTGA ACTTCTTTCA CAATATGACT TCCCAGGTGA TGACACGCCA 2 00 

GTAGTACGTG GTTCAGCGTT ACAAGCGTTA AACGGCGTAG CAGAGTGGGA 2 50 

35 AGAAAAAATT CTTGAATTAG CAAACCACTT AGATACATAC ATCCCAGAGC 3 00 

CAGAGCGTGC GATTGATAAA C C ATTC TT AT TACCAATCGA AGACGTATTC 3 50 

TCAATCTCAG GTCGTGGTAC AGTAGTAACA GGTCGTGTTG AGCGTGGTAT 400 

CATCAAAGCG GGTGAAGAAG TTGAAATCGT AGGTATCAAA GACACTGCGA 450 

AAACAACAGT AAC TGGCGTG GAAATGTTCC GTAAATTATT AGACGAAGGT 500 

40 CGTGCGGGTG AAAACGTTGG TGCATTATTA CGTGGTACAA AACGTGAAGA 550 

AATCGAACGT GGTCAAGTGT TAGCGAAACC AGGTACAATT ACACCACACA 600 

CAGACTTCGA ATCAGAAGTG TACGTATTAT CAAAAGAAGA AGGTGGTCGT 650 

CACACTCCAT TCTTCAAAGG TT AC CGTCC A CAATTCTACT TC C GT AC AAC 7 00 

TGACGTAACT GGTACTATTG AATTACCAGA AGGCGTAGAA ATGGTAATGC 750 

45 CAGGCGATAA CATCAAAATG ACAGTATCAT TAATCCACCC AATCGCGATG 800 

GACGAAGGTT TACGTTTTGC GATT 824 



50 2 ) INFORMATION FOR SEQ ID NO : 94 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

260 
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(A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

5 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTG 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 2 00 

10 ATACACCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTWGCA 25 0 

GAATGGGAAG AAAAAATCCT TGAATTAGCT AGCCACTTAG ATTCTTACAT 300 

TCCTGAGCCT CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 350 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 400 

CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 450 

15 G AC TGC G AAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACYAAA 550 

C GTG AAG AAA TC GAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT C TG AAG TG T A CGT ATTATC C AAAGAAGAAG 650 

G TGGTC GTC A TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

2 0 CGTACAACTG ACGTAACCGG AACTATCGAA TTACCGGAAG GCGTGGAAAT 750 

GGTTATGCCT GGTGATAACA TCAAAATGAC TGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTTTCGCTA TCG 833 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 95 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
30 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : - Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus paraphrophilus 

(B) STRAIN: ATCC 29241 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

TGGTGCTATC TTAGTAGTAG CAGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CTTATTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTA 100 

TTCTTAAACA AATGCGACAT GGTAGATGAC GAAGAGTTAT TAGAATTAGT 15 0 

45 TGAAATGGAA GTTCGTGAAC TTCTTTCTCA ATATGACTTC CCGGGTGACG 200 

ATACGCCAAT CGTACGTGGT TCTGCATTAC AAGCGTTAAA CGGCGTTGCA 25 0 

GAATGGGAAG AAAAAATCCT TGAATTAGCA AACCACTTGG ATACTTACAT 3 00 

TCCTGAGCCA CAACGTGCTA TCGACCAACC GTTCCTTCTT CCAATCGAAG 3 50 

ACGTGTTCTC TATCTCCGGT CGTGGTACAG TAGTAACAGG TCGTGTTGAG 400 

50 CGTGGTATCA TCCGTACCGG TGATGAAGTT GAAATCGTAG GTATCAAACC 45 0 

G AC TGC G AAA ACTACCGTAA CCGGTGTTGA AATGTTCCGT AAATTACTTG 5 00 

AC GAAGGTCG TGCAGGTGAA AACATCGGTG CATTATTACG TGGTACCAAA 55 0 

CGTGAAGAAA TCGAACGTGG TCAAGTATTG GCTAAACCGG GTTCAATCAC 600 

TCCACACACT GATTTCGAAT CTGAAGTGTA CGTATTATCC AAAGAAGAAG 650 

5 5 GTGGTCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA ATTCTATTTC 700 

CGTACAACTG ACGTAACCGG TACTATCGAG TTACCGGAAG GTGTGGAAAT 750 

GGTAATGCCT GGCGATAACA TCAAAATGAC CGTATCCTTA ATCCACCCAA 800 

TCGCGATGGA CCAAGGTTTA CGTT 824 

60 
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2 ) INFORMATION FOR SEQ ID NO: 96 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus segnis 

(B) STRAIN: ATCC 33393 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 



GCTATCTTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

GCACATCTTA TTAGGTCGCC AAGTAGGTGT TCCTTACATC ATCGTATTCT 100 

2 0 TAAACAAATG CGACATGGTA GATGACGAAG AGTTATTAGA ATTAGTTGAA 150 

ATGGAAGTTC GTGAACTTCT TTCTCAATAT GACTTCCCAG GTGATGATAC 200 

TCCAATCATT CGTGGTTC TG CATTACAAGC GTTAAACGGC GTAGCAGAAT 2 50 

GGGAAGAAAA AATCCTTGAA TTAGCTCAAG CATTAGATAC TTACATTCCT 3 00 

GAACCTGAGC GTGCAATCGA CCAACCGTTC CTTCTTCCAA TTGAAGACGT 3 50 

2 5 GTTCTCAATC TCTGGTCGTG GTACTGTAGT AACAGGTCGT GTAGAGCGTG 400 

GTATCATCCG TACCGGTGAT GAAGTTGAAA TCGTTGGTAT CAAACCAACT 450 

GCGAAAACAA CCGTAACCGG TGTTGAAATG TTC CGTAAAT TACTTGACGA 500 

AGGTCGTGCA GGTGAAAACA TCGGTGCATT ATTACGTGGT ACTAAACGTG 55 0 

AAGAAATCGA ACGTGGTCAA GTATTAGCGA AACCGGGTTC AATCACTCCA 600 

30 CACACTGACT TCGAATCTGA AGTGTACGTA TTATCTAAAG AAGAAGGTGG 65 0 

TCGTCATACT CCATTCTTCA- AAGGTTACCG TCCACAATTC TATTTCCGTA 7 00 

CAACTGACGT AACC GGT ACT ATC GAGTT AC CGGAAGGCGT GGAAATGGTT 750 

ATGCCTGGCG ATAACATCAA AATGAC CGTA TCCTTAATCC ACCCAATCGC 800 

GATGGACCAA GGTTTACG 818 

35 

2 ) INFORMATION FOR SEQ ID NO: 97 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 63 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvel 
50 (B) STRAIN: ATCC 13337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

CGGCGCTATC CTGGTTGTTG CTGCGACTGA CGGCCCTATG CCTCAGACTC 5 0 

55 GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTW CATCATCGTR 100 

TTCCTGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCWGGYGATG 2 00 

ATACTCCAAT CATC CGTGGT TCTGCTCTGA AAGCGCTGGA AGGYGMACCT 25 0 

GAGTGGGAAG CTAAGATCGT AGAACTGGCT GAAACTCTGG ATTCTTACAT 3 00 

60 YCCACARCCA GAACGTGCTA TCGAYAAGCC ATTCCTGCTG CCAATCGAAG 3 50 
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AC GTATTC TC TATCTC TGGC CGTGGTACWG 
CGCGGTATCG TTAAAGTTGG TGAAGAAGTT 
T AC CG TT AAA TCAACTTGTA CCGGCGTTGA 
ACGAAGGTCG TGCAGGCGAG AACGTTGGTG 
5 CG TGAAGAC A TCGAACGTGG TCAGGTTCTG 
GCCACACACC AAGTTCGAAT CAGAAGTTTA 
GCGGYCGTCA TACTCCGTTC TTCAAAGGCT 
CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGC 



TTGTTACCGG TCGTGTAGAG 400 

GARATCGTTG GTATCAAAGA 450 

AATGTTCCGT AAACTGCTGG 500 

TTCTGCTGCG TGGTATCAAG 550 

GCTAAACCAG GYTCYATCAA 600 

TATTCTGAGC AAAGATGAAG 65 0 

ACCGTCCACA GTTCTACTTC 700 

TTGCCAGAAG GCGTGGAAAT 7 50 

763 



2 ) INFORMATION FOR SEQ ID NO : 98 

15 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klngella kingae 
25 (B) STRAIN: ATCC 23330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

CGGCGCAATC TTGGTATGTT CAGCAGCTGA CGGTCCTATG CCACAAACTC 50 

3 0 GCGAACACAT CTTGTTGGCT CGCCAAGTAG GTGTACCTTA TATCATCGTA 100 

TTCATGAACA AATGC GAC AT GGTCGATGAT GCTGAGTTGT TGGAATTGGT 150' 

TGAAATGGAA ATCCGTGACT TGTTGTCTAG CTACGATTTT CCAGGCGACG 200 

ATTGCCCAAT CGTTCAAGGT TCTGCATTGC GYGCATTGGA AGGCGACGCT 2 50 

GCATACAAAG AAAAAATCTT TGAATTGGCT GCTGCTTTGG ATAGCTACAT 3 00 

3 5 TCCTACTCCA GAACGTGCTG TTGATAAACC ATTCTTGTTG CCAATCGAAG 35 0 

ATGTATTCTC TATCTC TGGT CGTGGTACAG TAGTTACTGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTAGG C G AAGAGATT GAAATCGTTG GTTTGAAAGA 450 

CACGCAAAAA ACCACTTGTA CTGGCGTGGA AATGTTCCGC AAATTGTTGG 5 00 

ACGAAGGTCA AGCTGGTGAT AACGTTGGTG TATTATTGCG TGGTACGAAG 5 50 

40 CGTGAAGACG TTGAACGTGG TCAGGTATTG GCTAAACCAG GTTCTATCAC 6 00 

TCCGCACACT AAATTTGAAG CTGAAGTGTA TGTGTTGAGC AAAGAAGAAG 650 

GTGGCCGTCA TACGCCATTC TTCGCTAACT ACCGCCCACA ATTCTACTTC 7 00 

CGTACGACTG ACGTAACTGG TGC AG TTACT TTGTCTGAGG GTGTGGAAAT 7 50 

GGTTATGC C A GGCGAAAACG TGAAAATCAC TGTTGAGTTG ATTGCACCTA 8 00 

45 TCGCTAGGAA AACGGTTTGC GTTTTGCG 82 8 



50 



2 ) INFORMATION FOR SEQ ID NO: 99 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) - TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Klebsiella omithinolytlca 

263 
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(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

5 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

^ TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAGCTGGTT 150 

GAAATGGAAG TCCGTGAGCT GCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACGCCGATC GTTCGTGGTT CCGCTCTGAA AGCGCTGGAA GGCGAAGCAG 250 

10 ACTGGGAAGC GAAAATCATC GAACTGGCTG GCTACCTGGA TTCTTACATC 300 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 5 00 

15 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCT AC TTCC 7 00 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 7 50 

2 0 GTCATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TTCACCCGAT 800 
CGCGATGGAC GATGGTCTGC GTTTCGCfc 



25 2 ) INFORMATION FOR SEQ ID NO : 100 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS: Double 

(D) "TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 100 



GATGCCGCAG ACTCGTGAGC ACATCCTGCT GGGTCGTCAG GTAGGCGTTC 50 

CGTACATCAT CGTGTTCCTG AACAAGTGCG ACATGGTTGA TGACGAAGAG 100 

CTGCTGGAAC TGGTTGAAAT GGAAGTTCGT GAACTTCTGT CTCAGTACGA 150 

TTTCCCGGGC GACGACACTC CGATCGTTCG TGGTTCTGCT CTGAAAGCGC 200 

45 TGGAAGGCGA CGCWGAGTGG GAAKCKAAAA TCATCGAACT GGCTGGCTTC 250 

CTGGATTCTT ACATTCCGGA AC C AGAGCGT GCGATTGACA AGCCGTTCCT 300 

GCTGCCGATC GAAGACGTAT TCTCCATCTC CGGTCGTGGT ACCGTTGTTA 350 

CCGGTCGTGT AGAGCGCGGT ATCATCAAAG TTGGCGAAGA AGTTGAAATY 400 

GTTGGTATYA AAG AC AC TGC TAAGTCTACC TGTACTGGCG TTGAAATGTT 450 

50 CCGCAAACTG CTGGACGAAG GCCGYGCTGG TGAGAACGTT GGTGTTCTGC 500 

TGCGTGGTAT CAAACGTGAA GAAATCGAAC GTGGTCAGGT ACTGGCTAAG 55 0 

CCGGGCTCTA TCAAGCCGCA CACCAAGTTC GAATCTGAAG TTTATATCCT 600 

GTCCAAAGAC GAAGGCGGCC GTCACACTCC GTTCTTCAAA GGCTACCGTC 65 0 

CGCAGTTCTA CTTCCGTACA ACTGACGTGA CTGGCACCAT CGAACTGCCG 7 00 

55 GAAGGCGTAG AGATGGTTAT GCCGGGCGAC AACATCAAAA TGGTTGTTA 7 49 



2 ) INFORMATION FOR SEQ ID NO : 101 

60 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

15 TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAGC TGCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTTCGTGGT TCCGCTCTGA AAGCGCTGGA AGGCGAAGCA 2 50 

20 GACTGGGAAG CGAAAATCAT CGAACTGGCT GGCTACCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCTATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACTGCTAAR TCWACCTGTA CYGGCGTTGA AATGTTC CGC AAACTGCTGG 5 00 

25 ACGAAGGCCG TGC TGGTG AG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCTCTATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 65 0 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

3 0 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATTCACCCRA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



35 2 ) INFORMATION FOR SEQ ID NO: 102 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGC G AC A TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAGATGGA 150 

AGTTCGTGAA CTGCTGTCTC AGTACGATTT CCCGGGCGAC GACACCCCGA 200 

55 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCCACCTG GATACCTATA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTC GTGTAGA GCGCGGTATC 400 

ATCAAAGTAG GTGAAGAAGT TGAAATCGTT GGTATC AAAG AAACCGCGAA 450 

60 AACCACCTGT AC TGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 
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GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 
ATCGAACGTG GTCAGGTACT GGCTAAGCCG 
CAAGTTCGAA TCTGAAGTGT ACATCCTGTC 
ATACTCCGTT CTTCAAAGGC TACCGTCCGC 
5 GACGTGACTG GCACCATCGA ACTGCCGGAA 
GGGCGACAAC ATCAAAATGG TTGTTACCCT 
ACGACG 



GTGGTATCAA ACGTGAAGAA 5 50 

GGCACCATCA ACCCGCACAC 600 

CAAAGACGAA GGCGGCCGTC 650 

AGTTCTACTT CCGTACTACT 700 

GGCGTAGAGA TGGTAATGCC 7 50 

GATCCACCCG ATCGCGATGG 800 

806 



10 

2) INFORMATION FOR SEQ ID NO: 103 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 743 bases 
15 (B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

GCAGACTCGT GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA 5 0 

TCATCGTGTT CCTGAACAAA TGCGACATGG TTGATGACGA AGAGCTGCTG 100 

G AAC TGGTTG AGATGGAAGT TCGTGAACTG CTGTCTCAGT ACGATTTCCC 15 0 

3 0 GGGC G AC G AC ACTCCGATCG TTCGTGGTTC TGCTCTGAAA GCGCTGGAAG 200 

" % GCGACGCAGA GTGGGAAGCG AAAATCATCG AACTGGGTGG -GCACCTGGAT 250 

ACCTATATCC CGGAACCAGA GCGTGCGATT GACAAGCCGT TCCTGCTGCC 300 

GATCGAAGAC GTATTCTCCA TCTCCGGTCG TGGTACCGTT GTTACCGGTC 350 

GTGTAGAGCG CGGTATCATC AAAGTAGGTG AAGAAGTTGA AATCGTTGGT 400 

3 5 ATCAAAGAAA CCGCGAAAAC CACCTGTACT GGCGTTGAAA TGTTCCGCAA 45 0 

ACTGCTGGAC GAAGGCCGTG CTGGTGAGAA CGTAGGTGTT CTGCTGCGTG 500 

GTATCAAACG TGAAGAAATC GAACGTGGTC AGGTACTGGC TAAGCCGGGC 550 

ACCATCAACC CGCACACCAA GTTCGAATCT GAAGTGTACA TCCTGTCCAA 600 

AGACGAAGGC GGCCGTCACA CTCCGTTCTT CAAAGGCTAC CGTCCGCAGT 650 

40 TCTACTTCCG T AC T AC TG AC GTG AC TGGC A CCATCGAACT GCCGGAAGGC 7 00 

GTAGAGATGG TAATG C CGGG CGACAACATC AAAATGGTTG TTA 743 



45 2 ) INFORMATION FOR SEQ ID NO: 104 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. rhinos cleromat is 

(B) STRAIN: ATCC 13884 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
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TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
5 GTTCGTGGTT 
GAAAATCATC 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTAGGT 

10 CCACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 

15 CGTGACTGGC 
GCGACAACAT 
GACGGTCTGC 



TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
RCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTCGCAA 



GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGATTTCC 
AGCGCTGGAA 
GCCACCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



CGCAGACTCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGACGCAG 
TACCTATATC 
CGATCGAAGA 
CGTGTAGAGC 
TATCAAAGAA 
AACTGCTGGA 
GGTATCAAAC 
CACCATCAAC 
AAGACGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCATCCGAT 



TGAGC ACATC 5 0 

TCCTGAACAA 100 

GAGATGGAAG 15 0 

C ACCCCGATC 2 0 0 

AGTGGGAAGC 250 

CCGGAACCAG 300 

CGTATTCTCC 35 0 

GCGGTATCAT 400 

ACCGCGAAAA 45 0 

CGAAGGCCGT 500 

GTGAAGAAAT 5 50 

CCGCACACCA 600 

CGGCCGTCAC 65 0 

GTACTACTGA 7 00 

GTAATGCCGG 750 

CGCGATGGAC 800 
819 



20 



30 
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2 ) INFORMATION FOR SEQ ID NO : 105 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 832 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA TGGCCCTATG CCACAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGYGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

40 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

AT ACTC CAAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGATGCA 2 50 

GAGTGGGAAG C GAAAATC AT CGAACTGGCT GGC TTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAACGTGCTA TCGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATC TCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

45 CGCGGTATCA TCAAAGTTGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

CACCGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTC TGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCTATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGACGAAG 650 

50 GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTTGAGAT 750 

GGTAATGCCA GGCGACAACA TCAAGATGGT TGTGACTCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA CC 832 



2 ) INFORMATION FOR SEQ ID NO : 106 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 83 0 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 



TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCTATG CCACAGACTC 5 0 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCTTA CATCATCGTG 100 

15 TTCCTGAACA AATGTG AC AT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTC TC A GTACGATTTC CCAGGCGACG 200 

ACACTCCTAT CGTTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGACGCT 250 

GAGTGGGAAG CAAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCGA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

2 0 ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

OGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

AC GAAGGC CG TGC TGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTTCTG GCTAAGCCAG GCTCCATCAA 6 00 

25 GCCGCACACC AAATTCGAAT CTGAAGTTTA CATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCAGAAG GC GTAG AG AT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 

30 



2 ) INFORMATION FOR SEQ ID NO: 107 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
45 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACTCGTG 50 

50 AGCACATCCT GCTGGGTCGT CAGGTTGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGCGACGACA 200 

CGCCGATCGT TCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCTGAG 25 0 

TGGGAAGCGA AAATCATCGA ACTGGCGGGC TTCCTGGATT CTTACATCCC 300 

55 GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACCGTTG TTACCGGTCG TGTAGAACGC 400 

GGTATCATCA AAGTTGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAC AC 45 0 

CGCTAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC TGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAACGT 550 

60 GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGTT CTATCAAGCC 600 
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GCACACCAAG TTCGAATCTG AAGTGTACAT 
GCCGTCATAC TCCGTTCTTC AAAGGCTACC 
ACTACTGACG TGACTGGCAC CATCGAACTG 
AATGCCGGGC GACAACATCA AAATGGTTGT 
5 CGAAGGACGA AGGTCTGCGT TTCGCA 



TCTGTCCAAA GACGAAGGCG 65 0 

GTCCGCAGTT CTACTTCCGT 7 00 

CCGGAAGGCG TTGAGATGGT 750 
TACCCTGATC CACCCGATCG 800 

826 



10 



2 ) INFORMATION FOR SEQ ID NO: 108 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Lactobacillus casei subsp. easel 

(B) STRAIN: ATCC 393 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 

2 5 GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACATATCT TACTTTCACG 50 

TCAAGTTGGT GTTCCATACA TCGTTGTATT CATGAACAAA TGTGACATGG 100 

TTGACGATGA AGAATTACTA GAATTAGTTG AAATGGAAAT TCGTGATCTA 150 

TTAACTGAAT ATGAATTCCC TGGCGATGAC ATTCCTGTAA TCAAAGGTTC 2 00 

AGCTCTTAAA GCACTTCAAG GTGAAGCTGA CTGGGAAGCT AAAATTGACG 2 50 

3 0 AGTTAATGGA AGC TGTAGAT TCTTACATTC CAACTCCAGA AC GTG ATACT 3 00 

GACAAACCAT TCATGATGCC AGTTGAGGAT GTATTCTCAA TCACTGGTCG 350 

TGGAACAGTT GCAACTGGAC GTGTTGAACG TGGACAAGTT AAAGTTGGTG 400 

ACGAAGTAGA AGTTATCGGT ATTGAAGAAG AGAGCAAAAA AGTAGTAGTA 450 

AC TGG AGT AG AAATGTTCCG TAAATYACTA GATTACGCTG AAGCTGGCGA 5 00 

3 5 CAACATTGGC GCACTTCTAC GTGGTGTTGC TCGTGAAGAT ATCCAACGTG 5 50 

GTCAAGTATT AGC TAAACC A GGTTCGATTA CTCCACACAC TAACTTCAAA 600 

GCTGAAACTT ATGTTTTAAC TAAAGAAGAA GGTGGACGTC ACACTCCATT 650 

CTTCAACAAC TACCGCCCAC AATTCTATTT CCGTACTACT GACGTAACTG 7 00 

GTATTGTTAC ACTTCCAGAA GGTACTGAAA TGGTAATGCC TGGTGATAAC 750 

40 ATTGAGCTTG CAGTTGANCT AATTGCACCA ATCGCTATCG AAGACGGTAC 800 

TAA 803 



45 2 ) INFORMATION FOR SEQ ID NO: 109 

(i-) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

5 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis subsp. lactis 

(B) STRAIN: ATCC 19435 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
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CGGTGCAATC CTCGTTGTTG CTGCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

TTCCTTAACA AGGCTGACCT TGTTGATGAT GAAGAATTGA TGGAACTCGT 15 0 

TGAAATGGAA GTTCGTGACC TCTTGAGCGA ATACGACTTC CCAGGTGACG 2 00 

5 ATATTCC TGT AATCGCTGGT TCAGCACTTG GTGCTTTGAA CGGTGAACCA 250 

CAATGGGTTG C T AAAGTTG A AGAATTGATG GACATCGTTG ATGAATACAT 3 00 

CCCAACTCCA GAACGCGACA CTGACAAACC ACTCCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC TATCACTGGT CGTGGTACAG TTGCTTCAGG ACGTATCGAA 400 

CGTGGTACTG TTAAAGTTGG TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

10 AGAAACTAAA AAAGCTGTTG TTACTGGTAT CGAAATGTTC CGTAAAACAC 500 

TTACTGAAGG TCTTGCTGGT GATAACGTCG GTGCACTTCT CCGTGGTATC 550 

CAACGTGACG AAATCGAACG TGGTCAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CACTCCACAC AAACTTTTCG AAGGTGAAGT TTAC GTATTG AGCAAAGAAG 65 0 

AAGGCGGACG TCACACTCCA TTCTTCGACA ACTACCGTCC TCAATTCTAC 700 

15 TTCCACACAA CTGACGTTAC TGGTTCAGTT AAACTTCCAG AAGGAACTGA 750 

AATGGTAATG CCTGGTGACA ACGTGCATAT CGACGTTGAA TTGATCCACC 800 

CAGTTGCGAT CGAACAAGGT ACT AC 82 5 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID InG: 11 G 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercla adecarboxylata 

(B) STRAIN: ATCC 23216 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 110 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTTC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 15 0 

40 GAGATGGAAG TTCGTGAACT YCTGTCCCAG TACGACTTCC CGGGCGACGA 2 00 

CACCCCAATC GTTCGTGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 25 0 

AGTGGGAAGA GAAAATCATC GARCTGGCTG GCTACCTGGA TTCCTACATC 3 00 

CCAGAGCCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CTATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTAC CGT TGTTACCGGT CGTGTAGAGC 400 

45 GCGGTATCAT CAARGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAGGAC 450 

AC TGCTAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA AC GTTGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCYATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATCCTGTCYA AAGACGAAGG 650 

50 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACKAC TG A CGTGACCGGT ACCATCGARC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCAAT 800 

CGCAATGGAC GATGGTCTGC GTTC 824 



2 ) INFORMATION FOR SEQ ID NO: 111 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 83 8 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella micdadei 

(B) STRAIN: ATCC 33218 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

CGGAGCGATA TTAGTAGTAT CAGCAGCGGA TGGCCCAATG CCTCAAACGA 5 0 

GAGAGCACAT ACTYTTATCC CGSCAGGTAG GTGTTCCCTA TATAGTAGTG 100 

15 TTCTTAAACA AAGCTGACAT GGTGGATGAT GCGGAGTTAT TAGAATTAGT 150 

TGAAATGGAA GTACGCGAYT TGTTGAGCAG CTATGAATTT CCAGGAGATG 200 

AGATCCCGAT TGTAGTTGGT TCAGCATTAA AAGCATTGGA AGGCGATACG 2 50 

AGTGATATAG GTGTACCAGC GATTGAGAAG TTAGTTGAGA CGATGGATTC 3 00 

TTATATACCT GAGCCGGTAA GAAACATCGA TAAAAGTTTC TTGTTACCGA 3 50 

2 0 TCGAAGACGT GTTCTCAATA TCTGGACGAG GAACAGTAGT AACAGGACGT 4 00 

ATCGAAAGCG GGATCATCAA AGTTGGTGAG GAAGTCGAGA TTGTTGGTAT 450 

ACGTGACACT CAAAAGACGA CATGCACAGG CGTTGAAATG TTC CGTAAAT 500 

TACTTGACGA AGGTCGAGCT GGAGACAACG TTGGTATATT GCTACGTGGT 550 

ACGAAGCGGG ATGAAGTTGA ACGCGGACAA GTATTAGCTA AGCCGGGAAG 600 

2 5 CATTAAACCG CATACTAAAT TTGAAGCTGA AGTGTATGTG TTGTCAAAAG 650 

ATGAAGGTGG ACGTCATACC CCATTCTTTA ACGGATATCG GCCTCAATTT 7 00 

TACTTCAGGA CCACAGACGT AACTGGTTCT TGTGATTTAC CTGARGGTAT 7 50 

AGAAATGGTA ATGCCAGGTG ATAACGTCAA GCTGATTGTT AGCTTACACT 800 

CACCGATTGC TATGGACGAA GGTTTGCGTT TTGCAATC 83 8 

30 ._. 



2 ) INFORMATION FOR SEQ ID NO: 112 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 
45 (B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

CGGAGCGATA CTGGTTGTAT CAGCAGCTGA TGGTCCTATG CCACAAACGA 50 

50 GGGAACACAT TCTATTGTCT CGCCAGGTAG GTGTTCCATA TATTGTTGTG 10 0 

TTCATGAACA AAGCGGATAT GGTTGATGAC CCTGAGTTAT TAGAGTTAGT 15 0 

GGAAATGGAA GTGCGAGATT TATTAAGCAG TTACGATTTC CCAGGGGATG 200 

ACATACCTAT TGTTGTTGGT TCAGCTTTGA AAGCATTGGA AGGTGAAGAC 25 0 

AGTGATATAG GCGTTAAGGC . TATTGAGAAA TTGGTTGAAA CAATGGATTC 3 00 

55 ATACATTCCT GAGCCAGTTA GAAACATAGA CAAGCCATTT TTGTTGCCGA 3 50 

TTGAAGACGT ATTTTCAATT TCTGGACGCG GAACAGTGGT AACTGGTCGT 400 

GTAGAGAGTG GAATTGTTAA AGTTGGTGAG GAAGTTGAAA TTGTTGGAAT 45 0 

AAGAGACACC CAAAAGACGA CTTGTACGGG TGTTGAGATG TTC CGTAAAT 500 

TACTTGATGA AGGTCGAGCT GGTGATAACG TTGGTGTGTT ATTACGAGGT 550 

60 ACGAAGCGAG ATGAAGTGGA GCGTGGACAG GTATTGGCGA AGCCAGGAAC 600 
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10 



CATCAAGCCA CACACCAAGT TTGAAGCAGA AGTGTATGTA TTATCCAAGG 650 

AAGAAGGCGG ACGTCACACT CCATTCTTTA ATGGATACCG TCCACAATTC 7 00 

TATTTCAGAA CCACTGACGT GACAGGTACT TGTGACTTGC CATCAGGAGT 7 50 

TGAAATGGTA ATGCCTGGAG ATAATGTGCA ATTAGTTGTT AGCTTGCATG 800 

CTCCGATTGC GATGGATGAA GGTTTAAGAT TCGCAATT 83 8 



2) INFORMATION FOR SEQ ID NO: 113 



( i ) SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Lemlnorella grimontii 

(B) STRAIN: ATCC 33993 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

2 5 GTGCAATCCT GGTAGTAGCA GCGACTGACG GCCCGATGCC TCAGACTCGC 50 

GAGCACATCC TGCTGGGTCG TCAGGTAGGC GTTCCGTACA TCATCGTATT 100 

CCTGAACAAG TGCGATATGG TTGATGACGA AGAGCTGCTG GAGCTGGTTG 150 

ARATGGAAGT TCGCGAACTG CTGTCTCAGT ACGACTTCCC GGGCGACGAC 200 

ACTCCGGTAG TCCGCGGTTC AGCGCTGAAA GCGCTGGAAG GCGAAGCCGA 250 

3 0 GTGGGAARCG AAAATCATCG AGCTGGCAGG CCMTCTGGAT ACTTATATCC 300 

CAGAACCTGA GCGTGCGATT GACAAiGCCGT TCCTGCTGCC KATCGAAGAC 3 50 

GTATTCTCTA TCTCCGGCCG TGGTACCGTT GTTACCGGTC GTGTAGAGCG 400 

CGGCATCATC AAAGTCGGTG AAGAAGTGGA AATCGTCGGT ATCAAAGATA 450 

CCACCAAGAC CACCTGTACC GGCGTTGAAA TGTTCCGTAA GCTGCTGGAC 500 

35 GAAGGCCGTG CGGGCGAGAA CGTGGGCGTT CTGCTGCGCG GTACCAAGCG 550 

TGACGAAATC GAACGTGGTC AAGTTCTGGC CAAGCCGGGC ACCATCACTC 600 

CTCACACCCA GTTCGTGTCA GAAGTGTATA TCCTGAGCAA GGATGAAGGC 650 

GGCCGTCATA CTCCGTTCTT CAAAGGCTAC CGTCCTCAGT TCTACTTCCG 700 

TACGACTGAC GTGACAGGCA CCATCGAACT GCCGGAAGGC GTAGAGATGG 750 

40 TAATGCCAGG CGACAACATT CAGATGACCG TAAGTCTGAT TGCGCCGATC 800 

GCAATGGACG AAGGTCTGCG CTTCGCAA 82 8 



45 2 ) INFORMATION FOR SEQ ID NO : 114 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

5 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminor&lla richardii 

(B) STRAIN: ATCC 33998 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 
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GCTATCCTGG TTGTTGCTGC GACTGACGGC CCAATGCCTC AGACTCGTGA 5 0 

GCACATCCTG CTGGGTCGCC AGGTAGGCGT TCCTTACATC ATCGTGTTCC 100 

TGAACAAGTG CGACATGGTT GATGACGAAG AGCTGCTGGA AC TGGTAG AA 15 0 

ATGGAAGTTC GTGAACTTCT GTCTCAATAC GACTTCCCGG GCGACGATAC 2 00 

5 GCCGGTTGTT CGCGGTTCAG CGCTGAAAGC GCTGGAAGGT GACGCYGAGT 250 

GGGAARCGAA AATCATTGAA CTGGCGGAAT CCTTRGATAC TTAYATTCCA 300 

GAGCCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCTA TCGAAGACGT 3 50 

TTTCTCTATC TCTGGCCGTG GTACTGTAGT CACCGGTCGT GTAGAGCGCG 400 

GCATCATCAA AGTTGGTGAA GAAGTGGAAA TCGTGGGAAT C AAAG AC AC C 450 

10 ACCAAGACCA CCTGTACTGG CGTTGAAATG TTCCGTAAGC TGCTGGACGA 5 00 

AGGCCGTGCA GGTGAGAACG TTGGTGTTCT GCTGCGYGGT ACTAAGCGTG 55 0 

ACGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCAGGCAC CATCACTCCT 600 

CACACAGAAT TCGTGTCAGA AGTGTATATC CTGAGCAAGG ATGAAGGCGG 65 0 

YCGTCATACT CCGTTCTTCA AAGGCTACCG TCCTCAGTTC TACTTCCGTA 7 00 

15 CGACTGACGT GACCGGCACC ATCGAACTGC CAGAAGGCGT AGAGATGGTA 7 50 

ATGCCAGGCG ATAACATCCA GATGGTAGTT ACGCTGATTG CCCCAATCGC 800 

GATGGACGAA GGTCTGCGCT TCGCAA 82 6 



20 



30 
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2 ) INFORMATION FOR SEQ ID NO: 115 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 843 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL _SOURCE: 

-(A) -ORGANISM: Leptospira interrogans 
(B) STRAIN: ATCC 23581 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 

TGCGGCGATT CTTGTAGTAT CCGCAACTGA CGGACCTATG CCACAAACAA 50 

AAGAACATAT CCTTCTTGCT CGTCAGGTAG GTGTTCCATA TGTAATTGTA 100 

TTCATTAACA AAGCAGATAT GCTTGCTGCT GACGAAAGAG CAGAAATGAT 15 0 

40 CGAAATGGTT GAGATGGACG TTCGTGAACT TCTCAATAAG TATAGCTTCC 2 00 

CAGGAGATAC AACTCCTATC GTTCATGGTT CTGCGGTAAA AGCACTTGAG 2 50 

GGCGATGAAT CTGAAATTGG GATGCCTGCA ATTCTCAAAT TGATGGAAGC 3 00 

TCTGGATACT TTCGTTCCAA ATCCAAAACG TGTAATCGAC AAACCTTTCC 3 50 

TTATGCCAGT AGAAGACGTT TTCTCGATCA CTGGTCGTGG AACTGTTGCA 400 

4 5 AC TGG AAG AG TGGAACAAGG TGTTTTGAAA GTGAACGACG AAGTTGAAAT 450 

TATCGGTATC CGCCCAACAA CAAAAACTGT TGTTACCGGT ATCGAAATGT 500 

TCAGAAAACT TCTCGATCAA GCGGAAGCTG GCGACAACAT CGGCGCTCTT 55 0 

CTTCGTGGAA CTAAAAAAGA AGAAATCGAA AGAGGGCAAG TTCTTGCGAA 600 

GCCAGGTTCT ATCACTCCTC ACAAAAAGTT TGCCGCTGAG GTGTATGTAT 65 0 

5 0 TAACTAAGGA TGAAGGCGGA CGTCATACTC CGTTTATCAA TAACTACCGT 7 00 

CCTCAGTTTT ACTTTAGAAC AACTGACGTA AC CGG AGTTT GTAACCTTCC 7 50 

TAATGGTGTC GAAATGGTTA TGCCTGGTGA TAACGTTTCT TTGACGGTTG 800 

AATTGATTAG CCCGATCGCA ATGGACAAGG GTCTTAAGTT CGC 84 3 



2 ) INFORMATION FOR SEQ ID NO: 116 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 832 bases 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Megamonas hypermegale 

(B) STRAIN: ATCC 25560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 



CGGTGCTATC CTCGTTGTTA GTGCTGCTGA TGGTCCTATG CCTCAGACTC 50 

GTGAACACAT CCTTCTCGCT CGTCAGGTTG GTGTTCCAGC TATCGTTGTA 100 

15 TTCCTCAACA AAGCTGACCA GGTTGATGAC CCTGAACTTC TCGAACTTGT 150 

TGAAATGGAA GTTCGTGAAC TTCTTTCCAG CTATGACTTC CCAGGCGATG 200 

ACGTTCCAGT AATCACTGGT TCCGCTCTTC AGGCTCTCGA AGGCGACGAA 250 

GAAGCTAAAA AGAAAATTCT TGAATTAATG GATGCTGTTG ATGATTACAT 3 00 

CCCAACTCCA ACACGTGACA CTGATAAACC TTTCTTAATG CCAGTTGAAG 3 50 

2 0 ACGTATTCAC AATTACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

eSTGGGGAAC TTAAACTTGG TGACAGCGTT GAA&TCGTTG GTCTTTCCGA 45 0 

TGAAAAGAAA TCCACTACTG TAACTGGTAT CGAAATGTTC CGCAAAATGC 500 

TTGATAGCGC TGTTGCTGGT GATAACATCG GTGCACTTCT TCGTGGTATT 550 

GACCGTAAAG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CTGGCACAAT 600 

2 5 TCATCCACAC AAAAAATTCA AAGCTCAGGT TTACGTATTA AC TAAAG AAG 65 0 

AAGGTGGACG TCATACTCCA TTCTTCTCCA ACTATCGTCC ACAGTTCTAT 700 

TTCCGTACTA CTGACGTTAC TGGTGTTGTA ACTCTTCCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACATTGAAAT GAGCATCGAA CTCATCACTC 800 

CAATCGCTAT TGAAAAAGGT CTTCGCTTCG CT 832 

30 



2 } INFORMATION FOR SEQ ID NO: 117 

3 5 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mitsuokella multaclda 
45 (B) STRAIN: ATCC 27723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 

TGGTGCTATC CTCGTCGTTT CCGCTGCTGA TGGCCCGATG CCGCAGACGC 5 0 

50 GTGAGCACAT CCTGCTCGCT CGCCAGGTCG GTGTTCCGGC AATCGTTGTC 100 

TTCCTCAACA AGGTTGACCA GGTTGACGAT CCGGAGCTCC TCGAGC TCGT 150 

CGAGATGGAA GTTCGCGAGC TGCTCTCCAG CTACGACTTC CCGGGCGATG 200 

ACATCCCTGT AATCGCTGGT TCCGCTCTGA AGGCCCTCGA AGGCGACGAA 2 50 

GAGCAGAAGA AGAACATCCT CAAGCTCATG GAAGCTGTCG ATGAGTACAT 3 00 

55 CCCGACGCCG GTCCGCGACA ACGCTAAGCC GTTCCTGATG CCGGTCGAGG 350 

ATGTCTTCAC GATCACGGGC CGTGGTACGG TTGCAACGGG CCGCGTTGAG 400 

CGTGGTGAGC TCAAGATGAA CGATACGGTT GAGATCGTTG GTCTGCAGGA 450 

CGAGCCGCGT CAGACGGTTG TCACGGGCAT CGAGATGTTC CGCAAGATGC 500 

TTGATTTCGC TGAGGCTGGC GATAACATCG GTGCTCTGCT CCGTGGTATC 55 0 

60 GACCGCAAGG AGATCGAGCG TGGCCAGGTT CTCGCAAAGC C GGGC AC GAT 600 
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TCATCCGCAC ACGAAGTTCA 
AAGGCGGCCG TCATACGCCG 
TTCCGCACGA CGGACGTAAC 
GATGGTTATG CCTGGCGATA 
5 CGATCGCTAT C G AG AAGGGC 



AGGCTCAGGT CTATGTCCTG 
TTCTTCACGA ACTATCGCCC 
TGGCGTAGTC AAACTGCCGG 
ACGTCGAGAT GGAAGTTGAG 



ACGAAAGAAG 65 0 

GCAGTTCTAC 700 
AAGGCACGGA 7 50 

CTCATCACCC 800 
820 
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2 ) INFORMATION FOR SEQ ID NO : 118 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Mobil uncus curtisii subsp . holmesii 

(B) STRAIN: ATCC 35242 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

2 5 CGGCGCTATC CTCGTGGTGG CTGCTACTGA CGGTCCGATG GCTCAGACCA 5 0 

AGGAACACAT CCTGTTGGCT AAGCAGGTTG GCGTGCCCTC CATCCTGGTC 100 

GCTCTGAACA AGTGC GATTC TTCCGATGTG GACGAAGACA TGCTCGAAAT 150 

CGTCGAGGAC GAAATCCGCG ATGACCTGGA GAAGCAGGGC TTCGATCGTG 2 00 

ACTGCCC GAT TATCCACGTT TCCGCTCTGA AGGCCCTGGA AGGCGACCCC 2 50 

3 0 GAGTGGACCA AGAAGATTGA AGAGCTCATG GAAGCGGTCG ATACCTACAT 3 00 

TCCTGAGCCT GTTCGTGACC TCGACAAGCC GTTCTTGATG CCTATCGAAG 350 

ACGTCTTCAC CATTACTGGT CGCGGTACCG TAGTGACCGG TCGTGTGGAA 4 00 

CGCGGCAAGC TACCGTTGAA CGCCGAAGTG G AAATC GT AG GTATTCGTCC 450 

TACGCAAAAG AC C AC CGTTA CCGGTATCGA AATGTTCCAC AAGTCCATGG 500 

3 5 ACGAAGC CTA CGCCGGCGAG AAC TGTGGTC TGTTGCTGCG TGGCACCAAG 55 0 

CGTGAGGACG TTGAGCGCGG TCAGGTTGTC TGCATTCCTG GCTCCGTGAC 600 

CCCGCACACC AAGTTCGAGG GCAAGGTCTA CATCTTGAAG AAGGACGAAG 650 

GTGGACGTCA CAAGTCGTTC TACGACGGCT ACCGCCCGCA GTTC TTCTTC 7 00 

CGCACCACCG ACGTGACCGG TGTTATTCAC CTGCCCGAAG GCACCGAAAT 750 

40 GGTTATGCCT GGCGACACCA CCGAAATTAG CGTTGAGCTG ATTCAGCCTA 800 

TCGCTATGGA GGAAGGTCTC GGCTTCGCTA T 831 



45 2 ) INFORMATION FOR SEQ ID NO: 119 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 

(B) STRAIN: ATCC 35017 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 

275 
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GGTGCAATTC TGGTTGTTGC TGCAACTGAT 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG 

TCCTGAACAA ATGTGACATG GTAGACGACG 

GAAATGGAAG TCCGTGAGCT GCTGTCTCAG 

5 CACTCCAGTA ATCCGTGGTT CAGCGCTGAA 

AGTGGGAAGC TAAAATCATT GAACTGGCAG 

CCAGAGCCAG AGCGTGACAT TGATAAGCCA 

CGTATTCTCA ATTTCAGGCC GTGGTACAGT 

GTGGTATCGT TAAAGTCGGT GAAGAAGTTG 

10 AC C GTGAAAA CAACATGTAC TGGCGTTGAA 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT 

GTGATGATAT CGAACGTGGT CAAGTATTGG 

CCGCATACAA CTTTCGAATC AGAAGTTTAC 

TGGCCGTCAT ACTCCATTCT TCAAAGGTTA 

15 GTACAACTGA CGTAACCGGT ACTATCGAAC 

GTAATGCCAG GTGATAACAT CAAAATGATC 

TGCAATGGAT GCAGGTCTGC GTTTT 



GGCCCTATGC CACAGACTCG 5 0 

CGTTCCATAC ATCATCGTTT 100 

AAGAGCTGTT AGAACTGGTT 15 0 

TACGATTTCC CAGGCGATGA 2 00 

AGCTCTGGAA GGCGAAGCTG 25 0 

AAGCACTGGA TTCTTATATC 3 00 

TTCCTGTTAC CAATCGAAGA 350 

TGTTACTGGT CGTGTTGAGC 400 

AAATCGTTGG TATCAAAGAT 450 

ATGTTCCGTA AACTGCTGGA 500 

TCTGCTGCGT GGTACTAAAC 550 

CTAAACCAGG TTCAATCACT 600 

ATCCTGAGCA AAGATGAAGG 65 0 

CCGTCCACAG TTCTACTTCC 7 00 

TGCCAGAAGG CGTTGAGATG 750 

GTTACTCTGA TCCACCCAAT 800 

825 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID NGrl2G 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 827 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

. Z:Jvi)ORIG INAL- "SOURCE: 

(A) ORGANISM: Branhamella catarrhal Is 

(B) STRAIN: ATCC 43628 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 

TGGTGCTATC TTGGTTGTTT CTGCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCATAT CCTACTATCT CGTCAGGTTG GTGTACCATA CATCATGGTA 100 

TTCATGAACA AGTGCGATAT GGTTGATGAT GAAGAGCTAC TAGAATTGGT 150 

40 TGAAATGGAA GTTCGTGAAC TTCTATCTGA CTATGATTTC CCTGGTGATG 200 

ATACCCCAAT CATCAAAGGT TCAGCACTAG AAGCATTGAA TGGTTCTGAT 250 

GGTAAATATG GCGAGCCTGC AGTTC TAG AA CTGCTAGACA CACTAGACAG 300 

C TATATC CCA GAGCCTGAGC GTGATATCGA TAAGTCATTC TTGATGCCAA 350 

TTGAAGATGT CTTCTCGATC TCAGGTCGTG GTACAGTTGT GACTGGTCGT 400 

45 GTTGAATCAG GTATTATTAA AGTTGGTGAT GAAATTGAAA TCATCGGTAT 450 

CAAACCAACT GCTAAAACCA CCTGTACTGG TGTTGAAATG TTCCGTAAAC 500 

TGTTAGACGA AGGTCGTGCA GGTGAGAACT GTGGTATCTT GTTGCGTGGT 550 

ACTAAGC GTG AAGAAGTTCA ACGCGGTCAA GTACTTGCAA AACCAGGTTC 600 

AATCACCCCA CATACTAAGT TTGATGCTGA AGTTTATGTA CTGTCAAAAG 650 

50 AAGAAGGTGG TCGTCACACC CCATTCTTAA ATGGCTATCG CCCACAGTTC 700 

TACTTCCGTA C C AC AG ATGT GACTGGTGCC ATCACTCTAC AAGAAGGTAC 750 

CGAAATGGTT ATGCCTGGTG ACAATGTTGA GATGAGTGTT GAGCTTATCC 800 

ACCCAATCGC CAGGATAAAG GTCTACG 827 



2) INFORMATION FOR SEQ ID NO: 121 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 806 bases 



276 
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10 



(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp. morgana 

(B) STRAIN: ATCC 25830 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 121 



CGGCGCTATC 
GTGAGCACAT 

15 TTCCTGAAC A 
TGAAATGGAA 
ACACGCCAAT 
GAGTGGGAAG 
CCCTGAGCCA 

2 0 ACGTATTCTC 
CGCGGTATCA 
TACTGCGAAA 
ACGAAGGCCG 
CGTGAAGAAA 

25 ACCACAYACC 
GTGGTCGTCA 
CGTACCACAG 
GGTAATGCCG 
TCGCAA 

30 • 



CTGGTTGTTG 
CCTGTTAGGT 
AATGTGACAT 
GTTCGTGAAC 
CGTTCGCGGT 
CTAARATCGT 
GAGCGTGCAA 
AATCTCCGGC 
TCAAGGTTGG 
ACCACCTGTA 
TGCMGGTGAG 
TCGAACGTGG 
AAATTTGAAT 
TACTCCATTC 
ACGTAACAGG 
GGCGACAACA 



CTGCAACTGA 
CGTCAGGTTG 
GGTTGATGAT 
TTCTGTCTCA 
TCAGCGCTGA 
TGAACTGGCA 
TTGACAAGCC 
CGTGGTACCG 
TGAGGAAGTT 
CCGGTGTTGA 
AACGTCGGTG 
TCAGGTTCTG 
CAGAAGTTTA 
TTCAAAGGYT 
TACTATCGAA 
TCAAAATGAT 



TGGCCCTATG 
GCGTTCCTTA 
GAAGAGCTGC 
GTACGATTTC 
AAGCACTGGA 
GGTTTCCTGG 
GTTCCTGCTG 
TTGTTAC CGG 
GAAATCGTGG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAACCAG 
TATTCTGAGC 
ACCGTCCACA 
CTGCCGGAAG 
CGTCACCCTG 



CCACAGACCC 50 

CATCATCGTA 100 

TGGAACTGGT 150 

CCTGGCGACG 2 00 

AGGCGAGCCA 250 

ATTCTTACAT 300 

CCAATCGAAG 3 50 

TCGTGTTGAG 400 

GTATCAAAGA 450 

AAAC TGCTGG 500 

TGGTACCAAG 550 

GTTCAATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 7 00 

GCGTTGAAAT 7 50 

ATCCACCCAA 800 
806 



2 ) INFORMATION FOR SEQ ID NO : 122 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 
45 (B) STRAIN: TB 299 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 122 

GGTGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CCCAGACCCG 50 

5 0 CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCTAC ATCCTGGTAG 100 

CGCTGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCTGCC CAGGAATTCG ACGAGGACGC 2 00 

CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT 250 

GGGTTGCCTC TGTCGAGGAA CTGATGAACG CGGTCGACGA GTCGATTCCG 3 00 

55 GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCGG TCGAGGACGT 3 50 

CTTCACCATT ACCGGCCGCG GAACCGTGGT CACCGGACGT GTGGAGCGCG 4 00 

GCGTGATCAA CGTGAACGAG GAAGTTGAGA TCGTCGGCAT TCGCCCATCG 45 0 

ACCACCAAGA CCACCGTCAC CGGTGTGGAG ATGTTC CGC A AGCTGCTCGA 500 

CCAGGGCCAG GCGGGCGACA ACGTTGGTTT GCTGCTGCGG GGCGTCAAGC 550 

60 GCGAGGACGT CGAGCGTGGC CAGGTTGTCA CCAAGCCCGG CACCACCACG 600 
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CCGCACACCG AGTTCGAAGG CCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
GCACCACCGA CGTGACCGGT GTGGTGACAC 
GTGATGCCCG GTGACAACAC CAACATCTCG 
5 CGCCATGGAC GAAGGTCTGC GTTTC 



ATCCTGTCCA AGGACGAGGG 650 
CCGTCCGCAG TTCTACTTCC 7 00 

TGCCGGAGGG CACCGAGATG 7 50 

GTGAAGTTGA TCCAGCCCGT 800 

825 



2 ) INFORMATION FOR SEQ ID NO: 123 

10 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Neisseria cinerea 

(5/ STRAIN : ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

2 5 CGGTGCGATC TTGGTATGTT CCGCAGCTGA CGGTCCTATG CCGCAAACTC 50 

GCGAACACAT CCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTTGACGAT GCCGAGCTGT TGGAGCTGGT 150 

TGAAATGGAA ATCCGTGACT TGCTGTCAAG CTACGACTTC CCAGGTGACG 200 

ACTGCCCGAT CGTACAAGGT TCTGCACTGA AAGCCTTGGA AGGCGACGCA 2 50 

3 0 GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCAACACCT GAGCGTGCAG TGGACAAACC TTTCTTGTTG CCTATCGAAG 350 

ACGTATTCTC TATTTCCGGT CGCGGTACAG TAGTAACCGG TCGTGTAGAG 400 

CGCGGTATCA TCCACGTTGG TGACGAGATC GAAATCGTAG GTCTGAAAGA 450 

AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

35 ACGAAGGTCA AGC TGGTG AC AACGTAGGTG TATTGCTGCG TGGTACTAAA 550 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTAC TATC AC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 650 

GTGGTCGTCA CACTCCGTTC TTCGCTAACT ACCGTCCACA ATTCTACTTC 700 

CGTACTACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTAGAAAT 750 

40 GGTAATGCCG GGTGAGAACG TAACCATTAC TGTAGAACTG ATTGCGCCTA 800 

TCGCTA 806 



45 2 ) INFORMATION FOR SEQ ID NO: 124 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 
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CGGCGCAATC TTGGTATGTT CCGCTGCYGA 
GCGAACACAT CCTGTTGGCC CGCCAAGTAG 
TTCATGAATA AATGCGACAT GGTTGAYGAT 
TGAAATGGAA ATC CGTG ACT TGCTGTCAAG 
5 ACTGCCCGAT CGTACAAGGT TCCGCACTGA 
GCTTACGAAG AAAAAATCTT CGAACTGGCT 
CCCGACACCT GAGCGTGCCG TGGACAAACC 
ACGTATTCTC TATCTCCGGC CGTGGTACAG 
CGCGGTATCA TCCACGTCGG TGACGAGATC 

10 AACCCAAAAA ACCACTTGTA CCGGTGTTGA 
ACGAAGGTCA AGCAGGTGAC AACGTAGGCG 
CGTGAAGAAG TGGAACGCGG TCAAGTATTG 
TCCTCACACC AAATTCAAAG CAGAAGTTTA 
GTGGTCGTCA TACTCCGTTC TTCGCTAACT 

15 CGTACTACCG ACGTAACCGG TGCGGTTACT 
GGTTATGCCT GGTGAGAACG TGGCCATCAC 
TCGCTATGGA AGAAGGTCTG CG 



CGGTCCTATG CCGCAAACTC 50 

GCGTACCTTA CATCATCGTG 100 

GCCGAACTGC TGGAACTGGT 150 

CTACGACTTC CCAGGCGACG 200 

AAGCCTTGGA AGGCGACGCA 250 

GCTGCATTGG ACAGCTACAT 300 

GTTCCTGTTG CCTATCGAAG 350 

TAGTAAC C GG TCGTGTAGAG 400 

GAAATCGTAG GTCTGAAAGA 450 

AATGTTCCGC AAACTGCTGG 500 

TATTGCTGCG CGGTACCAAA 550 

GCTAAACCGG GTAC CATC AC 600 

CGTATTGAGC AAAGAAGAGG 650 

ACCGTCCACA ATTCTACTTC 7 00 

TTGGAAGAAG GTGTAGAAAT 750 

TGTAGAACTG ATTGCACCTA 800 

822 



20 

2 ) INFORMATION FOR SEQ ID NO: 125 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 820 bases 
2 5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 1312 0 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 

CGGCGCGACT TGGTATGTTC CGCAGCTGAC GGTCCTATGC CGCAAACCCG 50 

CGAACACATC CTGTTGGCTC GCCAAGTAGG TGTACC TTAC ATC ATC GT AT 100 

TCATGAACAA ATGCGACATG GTAGACGATG CCGAGCTGTT GGAACTGGTT 150 

40 GAAATGGAAA TTCGTGACTT GTTGTCAAGC TACGACTTCC CAGGCGACGA 2 00 

CTGCCCAATC GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGTGATGCTG 25 0 

CTTACGAAGA AAAAATCTTC GAATTGGCTG CTGCCTTGGA CAGCTACATC 300 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 3 50 

CGTATTCTCT ATCTCTGGTC GTGGTACAGT AGTAACCGGT CGCGTAGAGC 400 

45 GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACTCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AAC TGCTGG A 500 

CGAAGGTCAA GCAGGTGACA AC GT AGGCGT ATTGCTGCGT GGTACTAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATCACT 600 

CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 650 

5 0 TGGTCGTCAC ACTCCATTTT TCGCTAACTA CCGTCCACAA TTCTACTTCC 700 

GTACTACCGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG C GT AGAAATG 750 

GTAATGCCAG G TG AG AAC GT AACCATTACT GTAGAACTGA TTGCGCCAAT 800 

CGCTATGGAA GAAGTCTGCG 820 

55 

2) INFORMATION FOR SEQ ID NO: 12 6 

( i ) SEQUENCE CHARACTERI STICS : 
60 (A) LENGTH: 83 0 bases 

279 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 49226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 



OPTGCAATCC TGGTATGTTC TGCTGCCGAC GGCCCTATGC CGCAAACCCG 50 

cgSca^tc CTGCTGGCCC GTCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

IS TCATGAACAA ATGCGACATG GTCGACGATG CCGAGCTGTT GGAACTGGTT 150 

gSItSSaaa TCCGCGACCT GCTGTCCAGC TACGACTTCC CCGGCGACGA 200 

CTGCCCGATC GTACAAGGTT CCGCACTGAA AGCCTTGGAA GGCGATGCCG 250 

CTTACGAAGA AAAAATCTTC G AACTGGC TA CCGCATTGGA C AGC T AC ATC 300 

CCGACTCCCG AGCGTGCCGT GGACAAACCA TTCCTGCTGC CTATCGAAGA 350 

20 CGTGTTC TCC ATTTCCGGCC GCGGTACCGT AGTCACCGGC CGTGTAGAGC 400 
GAGGTATCAT CCACGTTGGT- GACGAGATTG AAATC^«o 

ACCCAAAAAA CCACCTGTAC CGGCGTTGAA ATGTTCC GC A AACTGCTGGA 500 

GCGGGCGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGACGT AGAACGCGGT CAGGTATTGG CCAAACCGGG TACTATCACT 600 

25 CCTCACACCA AGTTCAAAGC AGAAGTGTAC GTATTGAGCA AAGAAGAGGG 650 

cSgCCgScAT ACCCCGTTTT TCGCCAACTA CCGTCCCCAA TTCTACTTCC 7 00 

GTACCACTGA CGTAAC CGGC GCGGTTACTT TGGAAAAAGG TGTGGAAATG 750 

G^tSSgG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 
CGCTATGGAA GAAGGTCTGC GCTTTGCGAT 



. 30 



2 ) INFORMATION FOR SEQ ID NO: 127 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 
45 (B) STRAIN: ATCC 2397 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 

CGGCGCAATC TTGGTATGTT CCGCCGCCGA CGGCCCTATG CCGCAAACCC 50 

50 GCGAACACAT TCTGTTGGCC CGCCAAGTAG GTGTACCTTA CATCATCGTA 100 

TTCATGAACA AATGCGATAT GGTCGACGAT GCCGAGCTGT TGGAAC TGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCAAG CTACGACTTC CCAGGCGACG 200 

ACTGCCCAAT CGTACAAGGT TCCGCACTGA AAGCTTTGGA AGGCGATGCC 250 

GCTTACGAAG AAAAAATCTT CGAACTGGCT GCCGCATTGG ACAGCTACAT 300 

55 CCCGACTCCC GAGCGTGCCG TGGACAAACC GTTCCTGCTG CCTATCGAAG 350 

ACGTATTCTC CATCTCCGGC CGCGGTACGG TAGTAACCGG CCGTGTAGAG 400 

CGCGGTGTCA TCCACGTTGG CGACGAGATC GAAATCGTCG GTCTGAAAGA 450 

AACCCAAAAA ACCACCTGTA CCGGTGTCGA GATGTTCCGC AAACTGCTGG 500 

ACGAAGGTCA GGCAGGCGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 55 0 

60 CGTGAAGAAG TGGAACGCGG TCAGGTATTA GCCAAACCGG GTACCATCAC 600 

280 



CA 02307010 2000-05-19 



TCCGCACACC AAGTTCAAAG CAGAAGTGTA 
GCGGTCGTCA CACTCCGTTC TTCGCCAACT 
CGTACCACCG ACGTAACCGG CGCGGTTACT 
GGTAATGCCC GGTGAGAACG TAACCATTAC 
5 TCGCTATGGA AGAAGG 



TGTATTGAGC AAAGAAGAGG 650 
ACCGTCCGCA ATTCTACTTC 7 00 

TTGGAAGAAG GCGTGGAAAT 7 50 

TGTAGAACTG ATTGCGCCTA 800 

816 



2 ) INFORMATION FOR SEQ ID NO : 128 

10 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: a 
20 (A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: ATCC 13077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128 

2 5 CGGTGCAATC CTGGTATGTT CCGCAGCCGA CGGTCCTATG CCGCAAACCC 50 

GCGAACACAT CCTGCTGGCC CGTCAAGTAG GCGTACCTTA CATCATCGTG 100 

TTCATGAACA AATGCGACAT GGTCGACGAT GCCGAGCTGT TGGAACTGGT 150 

TGAAATGGAA ATCCGCGACC TGCTGTCCAG CTACGACTTC CCCGGCGACG 200 

ACTGCCCGAT CGTACAAGGT TCCGCACTGA AAGCC TTGGA AGGCGATGCC 2 50 

30 GCTTACGAAG AAAAAATCTT CGAATTGGCT GCTGCATTGG ACAGCTACAT 3 00 

CCCGACTCCC GAGCGTGCCG TGGACAAACC TTTCTTGTTG CCTATCGAAG 3 50 

ACGTATTCTC TATTTCCGGT CGTGGTACAG TAGTAACCGG TC GTGT AG AG 400 

CGCGGTATCA TCCACGTCGG TGACGAGATC GAAATCGTCG GTCTGAAAGA 45 0 

AACTCAAAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 5 00 

35 ACGAAGGTCA AGCAGGCGAC AACGTAGGCG TATTGCTGCG CGGTACCAAA 55 0 

CGTGAAGACG TAGAGCGTGG TCAAGTATTG GCTAAACCGG GTACAATCAC 600 

TCCTCACACC AAGTTCAAAG CAGAAGTATA CGTACTGAGC AAAGAAGAGG 6 50 

GCGGCCGCCA TACCCCGTTC TTCGCCAACT ACCGTCCCCA ATTCTACTTC 7 00 

CGTACCACCG ACGTAACCGG CGCGGTTACT TTGGAAGAAG GTGTGGAAAT 7 50 

40 GGTAATGCCG GGCGAGAACG TAACCATCAC CGTAGAACTG ATTGCGCCTA 800 

TCGCTATGGA AGAAGGTTTG CGCTTTGCGA T 8 3 1 



45 2 ) INFORMATION FOR SEQ ID NO: 129 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, mucosa 

(B) STRAIN: ATCC 19696 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9 
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CGGCGCAATC TTGGTATGTT CTGCTGCYGA 
GYGAACACAT CC TGTTGGC C CGTCAAGTAG 
TTCATGAACA AATGCGACAT GGTTGACGAT 
TGAAATGGAA ATCCGTGACT TGCTGTCAAG 
5 ACTGCCCGAT TGTACAAGGT TCTGC AC TG A 
GCTTACGAAG AAAAAATCTT CGAACTGGCT 
CCCGACTCCC GAGCGTGCCG TAGACAAACC 
ACGTATTCTC CATCTCCGGT CGTGGTACAG 
CGCGGTGTTA TCCACGTTGG TGACGAGATC 

10 AACCCAAAAA ACCACATGTA CCGGTGTTGA 
ACGAAGGTCA AGCCGGTGAC AACGTAGGCG 
CGTGAAGAAG TGGAACGCGG TCAAGTATTG 
TCCGCACACC AAATTCAAAG CAGAAGTGTA 
GTGGTCGTCA TACTCCGTTC TTCGCTAACT 

15 CGTACTACCG ACGTAAC CGG TGCGGTTACT 
GGTTATGCCT GGTGAGAAYG TAGCCATYAC 
TYGCTATGGA AGAAG 



CGGTCCTATG CCGCAAACCC 5 0 

GYGTACCTTA CATCATCGTG 100 

GCCGAAYTGT TGGAACTGGT 150 

CTACGACTTC CCTGGYGACG 2 00 

AAGCCTTGGA AGGCGATGCC 2 50 

GCCGCATTGG ACAGCTACAT 3 00 

GTTCCTGTTG CCTATCGAAG 3 50 

TAGTAACCGG CCGTGTAGAG 400 

GAAATCGTAG GTC TG AAAG A 45 0 

AATGTTCCGC AAACTGCTGG 500 

TATTGCTGCG CGGTACCAAA 55 0 

GCTAAACCGG GTAC CATCAC 600 

CGTATTGAGC AAAGAAGAGG 650 

ACCGTCCTCA ATTCTACTTC 7 00 

TTGGAAGAAG GTGTAGAAAT 75 0 

TGTAGAACTG ATTGCGCCTA 800 

815 



25 



30 



20 

2) INFOF.MATION FOR SEQ ID NO: 13 0 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN : ATCC 9913 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 130 

GGCGCAATCT TGGTATGTTC CGCTGCTGAC GGTCCTATGC CGCAAACCCG 5 0 

CGAACACATC CTGTTGGCCC GCCAAGTAGG CGTACCTTAC ATCATCGTGT 100 

TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

40 GAAATGGAAA TCCGTGACTT GCTGTCAAGC TACGACTTCC CTGGTGACGA 200 

CTGCCCGATC GTACAAGGTT CTGCACTGAA AGC C TTGGAA GGCGACGCCG 25 0 

CTTACGAAGA AAAAATCTTC GAACTGGCTG CTGCATTGGA CAGCTACATC 3 00 

CCGACTCCTG AGCGTGCCGT GGACAAACCG TTCCTGTTGC CTATTGAAGA 350 

CGTATTCTCC ATCTCCGGTC GCGGTACCGT AGTAACCGGC C GTGT AG AGC 400 

45 GCGGTGTTAT CCACGTTGGT G AC G AG ATTG AAATCGTAGG TC TG AAAG AA 450 

ACCCAAAAAA CCACTTGTAC CGGTGTTGAA ATGTTCCGCA AACTGC TGG A 500 

CGAAGGTCAA GCCGGTGACA ACGTAGGCGT ATTGCTGCGC GGTACCAAAC 550 

GTGAAGAAGT GGAACGCGGT CAAGTATTGG CTAAACCGGG TACCATCACT 600 

CCTCACACTA AATTCAAAGC AGAAGTTTAC GTATTGAGTA AAGAAGAGGG 650 

5 0 TGGTCGTCAT ACTCCGTTCT TCGCTAACTA CCGTCCTCAA TTCTACTTCC 700 

GTACTACCGA CGTAACCGGC GCGGTTACTT TGGAAGAAGG TGTAGAAATG 7 50 

GTTATGCCTG GTGAGAACGT AGC CATC ACT GTAGAACTGA TTGCACCGAT 800 

CGCTATGGAA GAAGGTCTGC GCTTTGCGA 8 ^ y 



55 



2) INFORMATION FOR SEQ ID NO: 131 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 814 bases 

282 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

{ ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria subflava 

(B) STRAIN: ATCC 14221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 



CGGCGCGACT TGGTATGTTC CGCAGCTGAT GGTCCTATGC CTCAAACTCG 5 0 

cgScacatc CTGTTGGCTC GCCAAGTAGG TGTACCTTAC ATCATCGTAT 100 

IS TCATGAACAA ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT 150 

gSI?Sgaaa TCCGTGACCT GTTGTCAAGC TACGACTTCC CAGGCGACGA 200 

CTGCCCAATC GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGTGACGCTG 250 

GTTACGAAGA GAAAATCTTC GAATTGGCTG CTGCTCTGGA CAGCTACATC 3 00 

CCAACACCTG AGCGTGCTGT GGACAAACCT TTCTTGTTGC CTATCGAAGA 3 50 

2 0 CGTATTCTCT ATCTCTGGCC GTGGTACAGT AGTAACTGGT CGTGTAGAGC 400 

GCGGTATCAT CCACGTTGGT GACGAGATCG AAATCGTAGG TCTGAAAGAA 450 

ACCCAAAAAA CCACTTGTAC CGGCGTTGAA ATGTTC CGC A AACTGCTGGA 500 

CGAAGGTCAA GCTGGTGACA ACGTAGGCGT ATTGCTGCGT GGTACCAAAC 550 

GTGAAGACGT AGAGCGTGGT CAAGTATTGG CTAAACCAGG TACCATTACT 600 

25 CCTCACACCA AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG 65 0 

~" TGGTCGTCAC ACTCCATTCT TCGCTAACTA CCGTCCACAA TTCTACTTCC 7 00 

GTACTACTGA CGTAACTGGT GCAGTTACTT TGGAAGAAGG CGTAGAAATG 7 50 

GTAATGCCAG GTGAGAACGT AACCATTACT GTAGAACTGA TTGCGCCTAT 800 

CGCTATGGAA GAAG 814 

30 



2 ) INFORMATION FOR SEQ ID NO: 132 

3 5 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria weaver i 
45 (B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

GCCATCTTGG TATGTTCTGC TGCTGACGGT CCTATGCCGC AAACCCGTGA 50 

50 GCACATCCTG TTGGCTCGTC AAGTAGGTGT ACCCTACATC ATCGTATTCA 100 

TGAACAAATG CGATATGGTT GATGATGCAG AGCTGCTGGA ATTGGTAGAA 150 

ATGGAAATCC GTGATCTGCT GAGCAGCTAC GATTTCCCTG GCGATGATTG 200 

YCCAATCGTG CAAGGTTCTG CTTTGAAAGC TTTGGAAGGT GATGCCGCTT 250 

ACGAAGAAAA AATCTTTGAA TTAGCTGCTG CATTGGATAG CTATATTCCA 300 

55 ACWCCTGAGC GYGCTGTTGA TAAACCATTC CTGTTGCCGA TTGAAGATGT 350 

ATTCTCAATT TCAGGTCGTG GTACAGTAGT AACTGGTCGT GTAGAGCGCG 400 

GTATTATTCA CGTAGGCGAT GAAATTGAAA TTGTAGGTTT GAAAGARACY 450 

CAAAAAACTA CTTGTACCGG CGTTGAAATG TTCCGTAAAT TGCTGGATSA 500 

AGGTCAGGCT GGTGATAACG TAGGCGTATT GTTGCGTGGT ACCAAACGTG 55 0 

60 AAGACGTTGA GCGTGGTCAA GTATTGGCTA AGCCTGGTWC TATTACTCCG 600 
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CAYACCAAAT TCAAAGCAGA 
TCGTCATACT CCGTTCTTCG 
CTACAGACGT TACCGGTGCK 
ATGCCTGGTG AGAAYGTTGC 
5 KATGGAAGAA GGYTGCGT 



RGTKTATGTW TTGAGYAAGG 
CTAACTATCG TCCGCAATTC 
GTRACTTTAG AAGAAGGTGT 
CATTACTGTW GARYTGATYG 



AAGAAGGCGG 650 
TATTTCCGTA 7 00 

GGAAATGGTA 7 50 

CTCCGATTGC 800 
818 



2 ) INFORMATION FOR SEQ ID NO : 133 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
on (A) ORGANISM: Ochrobactrum anthropi 

(3) STRAIN: ATCC 4518-8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

9 r CCGCGC AATT CTGGTTGTTT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTCGCT CGTCAGGTTG GCGTTCCGGC AATCGTCGTG 100 

?tcctSSca AGTGCGACCA GGTTGACGAT GCAGAACTGC TCGAACTGGT 150 

SSctooaa GTTCGCGAAC TTCTGTCGAA ATACGATTTC CCGGGCGACG 2 00 

Sgttccgat CATCAAGGGC TCGGCTCTTG CTGCTCTGGA AGATTCTTCG 2 50 

3 0 ^ggSctgg GCGAAGACGC CGTTCGTTCG ctgatggccg ctgttgacga 3 00 

CTACATTCCG ACCCCGGAAC GTCCGATCGA CCAGCCGTTC CTGATGCCGA 3 50 

tcgSgacgt TTTCTCGATC TCGGGCCGTG GTACGGTTGT GACGGGTCGC 400 

gSSagcgcg GTATCGTCAA GGTTGGTGAA GAAGTTGAAA TCGTCGGCAT 450 

SS^SS GCGAAGACGA CGGTAACCGG CGTTGAAATG TTCCGCAAGC 500 

35 TGCTCGAYCA GGGCCAGGCT GGCGACAACA TCGGCGCTCT GATCCGCGGC 550 

GTTGGCCGTG AAGACGTTGA ACGCGGCCAG GTTCTCTGCA AGCCGGGTTC 600 

?S?G^AGCCG CACACCAAGT TCAAGGCAGA AGCCTACATT CTGACCAAGG 650 

3SS5qOTQQ CCGTCATACG CCGTTCTTTA CGAACTACCG TCCGCAGTTC 700 

TACTTCCGCA CGACGGACGT GACCGGTGTT GTCACGCTGC CGGAAGGCAC 750 

40 ggSItogtt ATGCCTGGCG ACAACGTCGC TATGGACGTC ACCCTGATCG 800 

TGCCGATCGC CATGGAAGAG AAGCTCCGCT TCGCTA yjb 



45 2 ) INFORMATION FOR SEQ ID NO: 134 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN: ATCC 27155 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 134 
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CCTGGTTGTT GCTGCGACTG ATGGCCCAAT GCCACAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT ACATCATCGT GTTCCTGAAC 100 

AAGTGTGACA TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAGATGGA 150 

^?Sg?gac CTGCTGTCAC AGTACGACTT CCCAGGCGAT GACACCCCGA 2 00 

5 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGTTCC TGAGTGGGAA 2 50 

g??SaStcg TTGAGCTGGC TGAACACCTG GACAACTACA TCCCGGATCC 300 

AGTCCGTGCG ATCGACATGC CGTTCCTGCT GCCAATCGAA GACGTATTCT 350 

CAATCTCTGG CCGTGGTACC GTTGTT AC CG GTCGTGTTGA GCGCGGCATC 400 

GTTAAAGTCG GCGACGAAGT TGAAATCGTG GGTATCAAAG ATACTGCGAA 450 

10 ATCAACCTGT ACCGGTGTTG AGATGTTCCG TAAGCTGCTG GACCAGGGTC 500 

AGGCAGGCGA AAAC TGTGGT GTTCTGCTGC GCGGTATCAA GCGTGAAGAC 550 

ATCCAGCGTG GCCAGGTTCT GGCTAAGCCA GGCTCAATCA AGCCGCACAC 600 

CCAGTTCGAG TCAGAAGTTT ACGTTC TGTC TAAAGACGAA GGTGGCCGCC 65 0 

ATACTCCGTT CTTCAAAGGC TATCGTCCAC AGTTCTACTT CCGTACAACT 7 00 

15 GATGTAACCG GTTCAGTAGA GCTGCCAGAA GGCGTTGAGA TGGTCATGCC 750 

AGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCA ATCGCAATGG 800 
ACGAA 



20 

2 ) INFORMATION FOR SEQ ID NO: 135 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 5 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30- 

- (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea dlspersa 

(B) STRAIN: ATCC 14589 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 

CGCGATCCTG GTTGTTGCTG CGACTGATGG CCCAATGCCT CAGACCCGTG 50 

AGCACATCCT GCTGGGCCGT CAGGTTGGCG TTCCTTACAT CATCGTGTTC 100 

CTGAACAAGT GTGACATGGT TGATGACGAA GAGCTGCTGG AAC TGGTTG A 150 

40 GATGGAAGTT CGCGATCTGC TGTCTCAGTA CGACTTCCCA GGCGACGATA 2 00 

CCCCAATCGT ACGCGGTTCT GCGCTGAAAG CGCTGGAAGG CGACGCTGAG 2 50 

TGGGAAGCGA AAGTCGTTGA GCTGGCTGGT CACCTGGATA CTTACATTCC 3 00 

AGATCCAGTA CGTGCTATCG ATCTGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCAAT CTCTGGCCGT GGTACCGTTG TTACCGGTCG TGTTGAGCGC 400 

4 5 GGCATCGTGA AAGTGGGCGA CGAAGTAGAA ATCGTTGGTA TCAAAGCGAC 4 50 

TGCCAAGTCT ACCTGTACCG GTGTTGAAAT GTTCCGCAAA CTGCTGGACC 500 

AGGGTCAGGC AGGCGAGAAC TGTGGTGTTC TGCTGCGCGG TATCAAGCGT 550 

GAAGAGATCC AGCGTGGTCA GGTTCTGGCT AAGCCAGGCA CCATCAAGCC 600 

AC AC AC C AAG TTCGTATCAG AAGTGTACGT ACTGTCTAAA GACGAAGGCG 650 

5 0 GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCACAGTT CTACTTCCGT . 7 00 

ACYACTGATG TGACCGGCAM CATMGAACTG CCAGAAGGCG TTG AGATGGT 750 
AATGCCAGGC GACAACATCA AAATGRCCGT TGAGCTGATC CACCCAATCG 800 
CGATGGACCA GGGTCTGCGT TTCGC 825 

55 

2 ) INFORMATION FOR SEQ ID NO : 13 6 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 762 bases 

285 
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10 



15 



20 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136 

CACAAACACG TGAGCACATC CTTTTAGGTC GCCAAGTAGG CGTTCCTTAC 50 

ATCATCGTAT TCTTAAACAA ATGCGACATG GTGGATGATG AAGAATTATT 100 

agSttagtt GAAATGGAAG TGCGTGAACT TCTTTCTCAA TATGATTTCC 150 

CAgStGATGA TACACCAATC GTACGTGGTT CAGCGTTACA AGCGTTAAAC 200 

rPYGTAGCTG AGTGGGAAGA GAAAATTCTT GAGTTAGCCA ACCACTTAGA 25 0 

SStJSS CCAGAGCCAC AACGTGCAAT cgaccaaccg ttccttcttc 300 

OCATTGAAGA CGTGTTCTCA ATTTCTGGTC GTGGTACAGT AGTAACAGGT 35 0 

GTGGTATCAT CCGTACAGGT GAAGAGGTTG AAATTGTTGG 400 

SSS « 7SSS5SZ |°° 

GGTACTAARC GTGAAGAAAT CGAACGTGGT CAAGTGTTAG CGAAACCGGG 550 
t??aa?y!cg ccacacactg attttgaatc AGAAGTTTAC GTGTTATCAA 600 
25 AAGAAGAAGG TGGTCGTCAT ACACCATTCT TCAAAGGTTA CCGTCCACAG 650 
TTCTACTTCC GTACAACGGA CGTAACAGGT ACAATCGAAT TACCGGAAGG 700 
TGTTGAGATG GTGATGCCTG GTGATAACAT CAAGATGACT GTAAGTTTGA 750 
TTCACCCAAT CG 

30 

2 ) INFORMATION FOR SEQ ID NO: 137 

{ i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 



TGGAGCTATC TTAGTTGTAT CAGCAGCGGA TGGACCAATG CCACAAACAA 50 

GAGAACACAT CTTATTATCA AGACAAGTAG GAGTACCATA TATCGTAGTA 100 

50 TATTTGAATA AAGCAGATAT GGTAGAAGAT GAAGAATTAT TAGAATTAGT 150 

aSaaItggaa GTAAGAGAAT TACTATCTGA ATATGGATTC CCAGGAGATG 200 

AAATTCCAAT CATAACAGGA TCATCCTTAG GAG C ATTAAA TGGAGAACAA 250 

AAATGGATAG ATCAAATCAT GGCATTGATG AAAGCCGTAG ATGAATATAT 300 

TCCAACACCG GAAAGAGCAG TAGATCAACC ATTCTTGATG CCAATCGAAG 350 

55 ACGTATTTAC AATTACAGGA AGAGGAACTG TAGTAACAGG AAGAGTTGAA 400 

AGAGGAGTTG TAAAAGTWGG AGAAGAAGTT GAAATCGTAG GAATCAAAGC 450 

GACAACAAAG AC AAC TTGTA CYGGAGTAGA AATGTTC CGA AAATTATTGG 500 

ATCAAGGACA AGCAGGAGAT AACATCGGAG CTTTATTRAG AGGAACCAAG 550 

aaagaagatg" Jagaaagagg ACAAGTATTG GCAAAACCAG GAACAATTCA 600 

TCCTCATACA AACTTCAGTG GAGAAGTATA TGTATTGACA AAAGAAGAAG 650 

286 
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GAGGAAGACA TACTCCATTC 
AG AAC C AC AG ATATTACAGG 
GGTAATGCCR GGAGATAATA 
TTGC AATGG A AACAGGATTA 



TTCTCAGGAT ACAGACCACA 
AGCAGTAACA TTACCAGAAG 
TCACAATGAC AGTAGAATTG 
CGATTTGCAA TT 



ATTTTACTTT 7 00 

GAG T AG AAAT 7 50 

ATTCACCCAA 800 
832 



2 ) INFORMATION FOR SEQ ID NO: 138 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
{ D ) TOPOLOGY : Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharolyticus 

20 (B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138 

TAG T ATGTTC AGCAGCAGAY GGTCCAATGC CACAAACAAG AGAACACATT 50 

25 25ScTAGCAA GACAAGTTGG TGTACCAAAG ATAGTAGTAT TCCTAAACAA 100 

AGAAGACCAA GTAGACGATC CAGAACTAAT TGAATTAGTA GAGATGGAAA 150 

TCAGAGACCT ACTATCAGAA TATGACTTCG AYGGAGACAA CACACCAATC 200 

CTAGTAGOAT CAGCATTAAA AGCCCTAGAC GATCCAGACG GAGAATGGGG 250 

AGACAAAATC GTAAAACTAA TGGAAGMAGT AGACGAATAC ATCCCAACAC 3 00 

3 0 CAGTAAGAGA TACAGAACAC CCATTCCTAA TGCCAATCGA AGACRTATTC 3 50 

3 55St?acag GAAGAGGAAC AGTAGCAACA GGAAGAGTAG AACAAGGTGT 400 

AGTAAAAGTA GGMGACACAG TAGAACTAGT AGGCTTAACA GACGAAAGCA 450 

GACAAGTAGT AGTAACAGGT GTAGAAATGT TTAGAAAACA ACTAGACCTA 500 

GCAGAAGCMG GAGACAACAT TGGAGCCCTA CTAAGAGGAG TACAAAGAGA 550 

35 AGAAATCCAA AGAGGACAAG TACTAGCAGC ACCAGGAACA ATCAAACCAC 6 00 

ACACAAAATT TGAAGCAGAA GTAT AC GT AC TAACAAAAGA AGAAGGTGGA 650 

AGACACACAC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTCAGAAC 7 00 

AAC AG AC GT A ACAGGAGACA TCCAACTAGC AG AC GG AGTA GAAATGGTAA 75 0 

TGCCAGGAGA CAACTCAACA TTTACAGTAA CACTAATCAC ACCAATCGCA 800 
40 ATGGACGAAG GACTAAGATT CGC 



2 ) INFORMATION FOR SEQ ID NO: 13 9 

45 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (a) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

60 CTATCATCGT AGTATCTGCA GCAGACGGTC CAATGCCCAA ACAGAGAGAA 

287 
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CACATCCTAC TAGCAAGACA AGTAGGCGTT CCAAAAATCG CAGTATTCCT 100 

S^cSagaa GACCAAGTAG ACGATCCAGA ACTAATCGAA TTAGTAGAAA 150 

TGGAAATCAG AGACCTACTT TCAGAATACG ACTTCGATGG AGACAACGCT 200 

SSwGTAO TAGGATCTGC TCTTAAATCA CTAGAAGAAG GCGGAGAAGG 250 

c CCCATGGTCA G AC AAAATC C TTGACCTAAT GGCACAAGTA GACGAATAC T 3 00 

Sa£a??cc agaaagagac AACGACCAAC CATTCCTAAT GCCAGTAGAA 350 

SaSotSItga CAATCTCAGG ACGTGGAACA GTAGCAACAG GAAGAGTTGA 400 

Sgaggaaca ctaaaagttg GTGATACAGT AGAAATCGTA GGACTAACAG 450 

^SS?aSS agaaacagta GTAACTGGAG TAGAAATGTT CCACAAATCM 500 

in otaSaccaag cagaatctgg AGATAACGTA GGACTACTAC TAAGAGGAGT 550 

SSSSgat caaatctcaa gaggacaagt ACTAGCAAAA CCAGGWTCAG 600 

TAAACCCACA CACAGAATTC G AAGGTC AAG TATACGTACT AACAAAAGAA 650 

g^ctgSac gtcacacacc attcttcagt ggatatagac cacaattcto 700 

CTTTAGAACA ACAGACGTAA CAGGAGACAT CGAACTAGAA GAAGGCGTAG 750 

15 SItcg^at GCCAGGAGAC aacgcaacat TCAAAATCAC ACTCCAAAAA 800 

CCAATCGCTC TAGAAGAAGG ACTAAGATTC GC " z 



20 2 ) INFORMATION FOR SEQ ID NO: 140 



25 



30 



35 



40 



45 



50 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(AL _ ORGANISM : Porphyromonas asaccharolytica 
~(B) STRAIN: ATCC 252 60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 

CGGTGCTATC ATCGTAGTTG CTGCAACTGA TGGTCCTATG 
GTGAGCACAT CCTACTAGCA CGTCAGGTCA ACGTACCTCG 
TTTATGAACA AGTGCGACCT TGTTGATGAC GAGGAGATGC 
AGAGATGGAT ATGCGTGAGC TACTAAGCTT CTATGAC TTT 
ACACTCCTGT CATCCGTGGT TCTGCTCTTG GTGCTCTCAA 
AAGTGGGTAG AGAAGGTTAT GGAGCTCATG GAGGCTGTAG 
CCCACTACCT GAGCGCGACA TCGACAAGCC TTTCCTAATG 
ACGTATTCTC TATCACAGGT CGTGGTACTG TCGCTACTGG 
ACTGGTGTCG TT AAGGTC AA CGATGAGGTT CAGATCATCG 
TGAGGGTAAG AAGAGCGTCG TAACTGGCGT GGAAATGTTC 
TTGATGAGGG TGAAGCTGGT GATAACGTAG GTCTCCTACT 
GACAAGGACG AGATCAAGCG CGGTATGGTC CTAGCACACC 
CAAGCCTCAC GATCACTTCA AGGCTGAGGT CTATATCCTG 
AGGGTGGTCG TCACACACCA TTCCACAACA AGTACCGTCC 
ATCCGTACGC TAGACGTAAC GGGCGAGATC ACACTCCCAG 
GATGGTTATG CCTGGTGATA ACGTCACCAT CGATGTCAAG 
CAGTAGCTTG TAGCGTAGGT CTACGCTTCG C 



CCTCAGACGC 50 

TCTAGTTGTC 100 

TCGAGCTCGT 150 

GACGGCGACA 200 

TGGTGAGCCT 250 

ACACTTGGAT 3 00 

CCTGTAGAGG 350 

TCGTATCGAG 400 

GTCTAGGTGC 450 

CGCAAGATCC 500 

CCGTGGTATC 5 50 

CAGGTCAGGT 600 

AAG AAG G AAG 650 

TC AGTTCTAC 7 00 

AGGGTGTAGA 750 

CTCATCTCTC 800 
831 



2 ) INFORMATION FOR SEQ ID NO: 141 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: ATCC 33277 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

CGGTGCTATA ATC GTTGTAG CAGCTACAGA CGGTCCTATG CCTCAGACTC 50 

GCGAGCACAT CCTTTTGGCT CGCCAGGTAA ACGTTCCTCG TCTGGTTGTT 100 

TTCATGAACA AATGTGACAT GGT AG AC GAT GAAGAGATGC TCGAGCTTGT 15 0 

15 TGAAATGGAC ATGCGCGAAC TC CTTTCTTT CTACGATTTC GATGGTGACA 200 

ATACCCCTAT CATCCGTGGT TCTGCTCTGG GCGCTTTGAA TGGAGAGCCT 25 0 

CAGTGGGAAG ACAAGGTGAT GGAGCTTATG GAAGCTGTTG ACAACTGGGT 3 00 

TCCCCTGCCT GAGCGCGATA TCGACAAACC GTTCTTGATG CCGGTTGAAG 3 50 

ACGTGTTCTC TATCACGGGT CGTGGTACGG TCGCTACAGG ACGTATCGAA 400 

20 ACCGGTATTG TGAAGACCGG TGACGAAGTT CAAATCATCG GCCTCGGTGC 450 

AGAAGGAATG AAGTC GGTTG TTACGGGTGT TGAAATGTTC CGTAAGATTC 500 

TTGACGAAGG TCAGGCTGGT GACAACGTTG GTCTCCTCCT GCGTGGTATC 550 

GATAAGGATC AGATCAAGCG TGGTATGGTT ATCTCTCACC CGGGTAAGAT 600 

TACTCCTCAC AAGAGATTTA AGGCCGAGGT TTATATCTTG AAGAAAGAAG 650 

2 5 AAGGTGGTCG CCACACTCCT TTCCACAACA AATATCGTCC GCAGTTCTAC 700 

ATCCGTACGC TTGACGTGAC CGGTGAAATC ACTCTTCCCG AAGGAACAGA 7 50 

AATGGTTATG CCCGGTGACA ACGTAACGAT CACTGTAGAA CTCATCTACC 800 

CGGTTGCATG TAATGTAG 818 



30 



45 



2 ) INFORMATION FOR SEQ ID NO: 142 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontlum 

(B) STRAIN: ATCC 49100 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 



CGGCGCTATT CTGGTTGTTG CTGCAACTGA TGGTCCTATG CCTCAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAGGTTG GCGTACCATA CATCATTGTG 100 

50 TTCCTGAACA AGTGTGACAT GGTTGAYGAT GAAGAGCTGT TAG AAC TGGT 150 

TGAAATGGAA GTTCGTGAGC TTCTGTCTCA GTACGATTTC CCAGGTGATG 2 00 

ATACTCCAGT TGTTCGTGGT TCTGCGCTGA AAGCGTTRGA AGGCGAAGCT 250 

GAGTGGGAAG CTAAAATCAT TGAATTGGCT GACTCCCTGG AYAGCTACAT 300 

TCCACAGCCA GAGCGTGCAA TTGATAAGCC GTTCCTGCTG CCAATCGAAG 350 

55 ACGTTTTCTC AATCTCTGGC CGTGGTACAG TAGTAACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

TACTGTGAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AARTT AC TGG 500 

ATGAAGGCCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAG 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCAAAACCAG GTTCAATCAA 600 

60 CCCGCATACT AACTTCGTAT CAGAAGTTTA TATCCTGAGC AAAGATGAAG 650 
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GTGGTCGTCA TACTCCATTC TTCAAAGGCT 
CGTACAACTG ACGTGACCGG TACCATCGAA 
GGTAATGCCA GGTGATAACA TTCAGATGAC 
TCGCGATGGA CGAAGGTTTA CGCTTCGCTA 



ACCGTCCACA GTTYTACTTC 7 00 

CTGCCAGAAG GCGTAGAGAT 7 50 

TGTAACTCTG ATTGCCCCAA 800 

830 



2 ) INFORMATION FOR SEQ ID NO: 143 

10 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella melaninogenica 
20 (B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

TGGTGCTATC TTGGTTG TAG CTGCTACTGA TGGTCCTATG CCTCAGACTC 5 0 

25 GTGAGCACGT ATTGCTCGCT CGTCAGGTAA ACGTACCTCG CTTGGTTGTA 100 

TTCTTGAACA AGTGTGATAT GGTTGACGAT GCTGAGATGC TTGACCTCGT 15 0 

TGAGATGGAG GTTCGTGAGA TCCTCGAGCA GTACGGTTAT GAGGAGGATA 2 00 

CTCCTATTAT TCGTGGTTCT GCACTCGGTG CTTTGAACGG TGTTGAGAAG 2 50 

TGGGTAG AC T CTGTAATGGA GCTCATGGAT ACTGTTGACA CTTGGATTGA 3 00 

3 0 AGAGCCAGAG CGTGAGATTG ACAAGCCATT CTTGATGCCT GTTGAGGACG 3 50 

TATTCTCTAT CACAGGTCGT GGTACTGTAG CTACTGGTCG TATCGAGACT 4 00 

GGTATCTGTA AGGTAGGTGA TGAGGTTCAG TTGCTCGGTC TCGGTGAGGA 450 

CAAGAAGTCT GTTATCACTG GTGTTGAGAT GTTCCGTAAG AACCTTCCAA 5 00 

CAGGTCAGGC TGGTGACAAC GTAGGTCTCC TCCTTCGTGG TATC GATAAG 550 

3 5 GCTGAGGTTA AGCGTGGTAT GGTTGTTGTG CACCCAGGTG CTATTACTCC 6 00 

TCACGATCAC TTCAAGGCAT CTATCTATGT ATTGAAGAAG GAAGAGGGTG 650 

GTCGTCATAC TCCATTCGGT AACAAGTATC GTCCACAGTT CTACCTCCGT 7 00 

ACAATGGACT GTACAGGTGA AATCCACCTC CCAGAGGGCG TTGAGATGGT 7 50 

TATGCCAGGT GACAACGTAG AGATTGAAGT TGTATTGATC TATAAGGTTG 800 

40 C TTTGAACG A GGGTCTTCGT T 821 



45 



2) INFORMATION FOR SEQ ID NO: 144 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Prevotella oralis 

(B) STRAIN: ATCC 33269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

60 TGGTGCTATT CTTGTAGTAG CTGCTACTGA CGGTCCTATG CCTCAAACTC 50 
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GTGAACACGT 
TTCTTGAACA 
AGAAATGGAG 
CTCCTATTGT 
5 TGGGTTGACA 
GGAACCACCG 
TATTTTCTAT 
GGTAAGGTTA 
TAAGAAGTCC 

10 AAGGTGAAGC 
ACGGAAGTAA 
TCACGATCAT 
GTCGCCATAC 
ACCATGGACT 

15 AATGCCGGGT 
CTTTGAACGA 



GCTTCTTGCT 
AGTGCGATAT 
CTTCATGAAC 
TCGTGGTTCG 
GCGTGATGAA 
CGTGATCTTG 
TACTGGTCGT 
AGGTGGGCGA 
GTTGTGACAG 
TGGTGATAAT 
AGCGTGGTAT 
TTCAAGGCTT 
TCCGTTTGGT 
GTACTGGTGA 
GATAACGTCG 
GGGACTTCGT 



CGTCAGGTGA 
GGTTGACGAT 
TTCTCGAGCA 
GCACTTGGCG 
GTTGATGGAT 
ATAAGCCTTT 
GGAACGGTTG 
TGAAGTTCAA 
GCGTTGAGAT 
GTAGGCTTGC 
GGTTGTCGTA 
CAGTTTACGT 
AMCAAGTATC 
AATTACTCTT 
AAATTGAAGT 
TTCGCTA 



ACGTACCTCG 
GAAGAAATGC 
GTATGAATAT 
CTCTGAATGG 
ACCGTTGATG 
CTTGATGCCG 
TTACAGGCCG 
CTTCTTGGTC 
GTTCCGTAAG 
TGCTTCGTGG 
CATCCGGGGG 
ATTGAAGAAA 
GTCCACAGTT 
CCGGAAGGAG 
TAAGTTGATC 



TTTGGTCGTT 100 

TTGAGCTCGT 150 

GAGGAGGATA 2 00 

AGTAGAGAAG 2 50 

AATGGATACA 3 00 

GTAGAGGATG 3 50 

TATTGAAACT 400 

TCGGTGAAGA 450 

ATTCTTGACG 500 

TATCGATAAG 550 

CTATTACTCC 600 

GAAGAAGGCG 650 

CTATCTTCGT 700 

TTGAGATGGT 7 50 

TATC CGGTAG 800 
827 



20 2 ) INFORMATION FOR SEQ ID NO: 145 



25 



30 



35 



40 



45 



50 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prop ioni bacterium acnes 

(B) STRAIN: ATCC 6919 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 145 



CGGCGCCATC CTCGTGGTTG CTGCTACCGA CGGCCCGATG 
GCGAGCACGT TCTGCTCGCT CGTCAGGTGG GCGTGCCCGC 
GCCCTCAACA AGTGCGACAT GGTTGACGAT GAGGAGCTCA 
CGAGATGGAG GTCCGCGAGC TGCTGACCTC GCAGGAGTTC 
ACTGCCCTGT CGTTCGCATC TCCGCCTTCC AGGCCCTCCA 
AAGTGGACCC AGTCGATCCT CGACCTCATG GACGCCGTGG 
CCCGCAGCCT GAGCGCGATC TCGACAAGCC CTTCCTTATG 
AC GTC TTC AC CATCACCGGC CGTGGCACCG TTGTCACCGG 
CGCGGCGTCG TCAAGACTGG CGAAGAGGTC GAGATCGTCG 
GAAGACCCAG AAGACCACCG TTACCGGTGT CGAGATGTTC 
TCGACGAGGG CCGCGCTGGT GAGAACGTCG GCGTTCTGCT 
AAGAAGGAGG ATGTCGTTCG CGGCATGGTC CTCTCCAAGC 
CACCCCCCAC ACCGACTTCG AGGGCCAGGT CTACGTCCTC 
AGGGTGGCCG CCACAAGCCG TTCTTCTCCC ACTACAGCCC 
TTCCGTACCA CGGACGTGAC TGGCACTGTT GAGCTCCCCG 
GATGGTCATG CCTGGCGACA ACACCGACAT GACTGTGCAC 
CGGTTGCCAT GGAGGATCAG CTCAAGTTCG CTA 



CCTCAGACTC 50 

CATCGTCGTC 100 

TTGAGCTCGT 150 

GACGGCGACA 2 00 

GGGTGATGAG 2 50 

AC G AGT AC AT 3 00 

CCGATCGAGG 350 

TCGTGTCGAG 400 

GTATCCACGA 45 0 

CGCAAGATCC 5 00 

CCGTGGCACC 55 0 

CTGGTTCCAC 600 

AAGAAGGATG 65 0 

CCAGTTCTAC 7 00 

AGGGCACCGA 750 

CTGATTCACC 800 
833 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mlrabilis 

(B) STRAIN: ATCC 35659 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 

CACAAACTCG TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC 50 

ATCATCGTAT TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGCTGTT 100 

AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAA TACGATTTCC 15 0 

15 CAGGTGATGA CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCACTGGAA 2 00 

GGCGAAGCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA 2 50 

TTCTTATATC CCAGAGCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC 3 00 

CAATCGAAGA TGTATTCTCA ATCTCAGGCC GTGGTACAGT AGTTACTGGT 3 50 

CGTGTAGAGC GTGGTATCAT CAAAGTAGGT GATGAAGTTG AGATTGTTGG 400 

2 0 TATCAAAGAA ACCGCCAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 450 

AATTACTTGA eGAASGTCGT GCAGGTGAGA ACGTAGGTGT TCTGCTGCGT 5 00 

GGTACAAAAC GTGAAGAAAT CGAACGTGGA CAAGTACTGG CRAAACCAGG 55 0 

CTCAATCAAC CCACACAACA AATTTGAATC AGAAGTTTAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

2 5 TTCTACTTCC GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG 700 

CGTAGAAATG GTAATGCCAG GCGACAACGT GAACATGATC GTTGA 745 



3 0 2 ) INFORMATION FOR SEQ ID NO: 147 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus pennerl 

(B) STRAIN: ATCC 33519 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 



GGAGCTATCC TGGTTGTTGC TGCGACAGAT GGCCCAATGC CACAAACTCG 50 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG TGTTCCTTAC ATCATCGTAT 100 

TCCTGAACAA ATGTGACATG GTAGATGATG AAGAGTTACT GGAATTAGTM 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CAGGTGATGA 200 

50 CACTCCAGTA ATCCGTGGTT CAGCGCTGAA AGCACTGGAA GGCGAAGCAG 250 

AGTGGGAAGC AAAAATTGTT GAATTAGCAG AAGCACTGGA TTCATACATC 300 

CCAGARCCAG AGCGTGCAAT TGACAAACCA TTCCTGTTAC CAATTGAAGA 350 

CGTATTCTCA ATTTCAGGCC GTGGTACAGT AGTAACAGGT CGTGTTGAGC 400 

GTGGCGTAAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATTAAACCA 450 

55 AC AG C G AAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA AATTACTTGA 500 

CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT TCTTCTGCGT GGT AC TAAAC 550 

GTGAAGAAAT CGAACGTGGA CAAGTACTGG CGAAACCAGG TTCAATCAAC 600 

CCACACACTA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

60 GTACAACTGA CGTAACTGGT ACTATCGAAT TACCAGAAGG CGTAGAAATG 750 
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GTAATG C C AG GTGACAACAT CAACATGATC GTTGAACTGA TTCACCGAAT 800 
CGCGATGGAC GACGGTTTAC GTTTCGCTA 82 9 



2 ) INFORMATION FOR SEQ ID NO : 148 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 4 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 

CGGAGCTATT CTGGTTGTTG CTGCG AC TG A TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGCCAGGTTG GTGTACCTTA CATCATCGTA 100 

TTCCTGAACA AATGTGACAT GGTTGATGAT GAAGAACTGC TGGAATTAGT 150 

2 5 AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGTGATG 200 

ACACTCCAGT AATCCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 25 0 

GAGTGGGAAG CAAAAATTGT TGAATTAGCA GAAGCACTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCTATCGAAG 3 50 

ACGTATTCTC AATCTCTGGT C GTGGT AC AG TAGTAACAGG CCGTGTAGAG 400 

3 0 CGTGGTGTTG TTAAAGTTGG TGAAGAAGTT GAGATTGTTG GTATTAAAGA 450 

CACAGTTAAA AC AAC TTGT A CTGGCGTTGA AATGTTCCGT AAATTACTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG TTCTTCTGCG TGGTAC TAAA 55 0 

CGTGAAGAAA TCGAACGTGG ACAAGTACTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACT AAATTCGAAT CAGAAGTTTA TATCCTGAGC AAAGATGAAG 650 

35 GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTAACTGG TACTATCGAA TTACCAGAAG GCGTAGAAAT 7 50 

GGTAATGCCA GGTGACAACA TCAACATGAT CGTTGAACTG ATTCACCCTA 800 

TCGCGTAGGA CGACGGTTTA CGTT 82 4 



40 



55 



2 ) INFORMATION FOR SEQ ID NO: 149 



( i ) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencla alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 



CACAAACTCG TGAGCACATC CTGTTAGGTC GCCAAGTAGG TGTTCCTTAC 50 
ATCATCGTTT TCCTGAACAA ATGTGACATG GTAGACGACG AAGAACTGTT 10 0 

60 AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC 15 0 
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CAGGCGATGA CACTCCAGTT GTTCGCGGTT 
GGCAACCCAG AGTGGGAAGC AAAAATTGTT 
TTCTTACATC CCAGAACCAG AGCGTGCAAT 
CAATCGAAGA CGTATTCTCA ATCTCTGGTC 
5 CGTGTTGAGC GTGGTATCAT CAAAGTTGGT 
TATTCAAGCG ACTGCGAAAA CAACTTGTAC 
AACTGCTGGA TGAAGGTCGT GCGGGTGAGA 
GGTACTAAAC GTGAAGAAAT TCAACGTGGT 
TTCAATCAAG CCACACACTC AATTCGAATC 
10 AAGATGAAGG TGGTCGTCAT ACTCCATTCT 
TTCTACTTCC GTACAACTGA CGTAACCGGT 
CGTAGAGATG GTAATGCCAG GCGACAACAT 



CAGCACTGAA AGCGCTGGAA 2 00 

GAATTAGCAG GTTACCTGGA 2 50 

TGACAAGCCA TTCCTGCTGC 3 00 

GTGGTACAGT AGTAAC AGGC 3 50 

GAAGAAGTTG AAATCGTTGG 400 

TGGCGTTGAA ATGTTC CGT A 45 0 

ACGTTGGTGT TCTGCTGCGT 500 

CAAGTACTGG CTAAACCAGG 55 0 

AGAAGTATAT ATTCTGAGCA 600 

TCAAAGGCTA CCGTCCACAG 650 

ACTATCGAAC TGCCAGAAGG 7 00 

CAACATGATC GTGAC 7 45 



15 



2) INFORMATION FOR SEQ ID NO: 150 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 830 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAAGTAG GTGTWCCTTA CATCATCGTT 100 

TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAACTGT TAGAATTAGT 150 

35 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATAC GATTTC CCAGGCGACG 200 

ACACTCCAGT TGTCCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCAACCCA 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCA GGTCACTTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCAATCGAAG 3 50 

. ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTTGAG 400 

40 CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACGGTTAAA AC AAC TTGT A CTGGCGTTGA AATGTTC CGT AAACTGCTGG 500 

ACGAAGGTCG TGCGGGTGAG AAC G TTGGTG TTCTGCTGCG TGGTACTAAA 55 0 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCAAAACCAG GTTCAATCAA 600 

GCCACACACT AAATTCGAAT CAGAAGTCTA TATTCTGAGC AAAGATGAAG 65 0 

45 GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 7 50 

GGTAATGCCA GGTGATAACA TCAACATGAT CGTTACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGTTTA CGTTTCGCAA 83 0 

50 

2 ) INFORMATION FOR SEQ ID NO: 151 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Provldencla rustiglanii 

(B) STRAIN : ATCC 33673 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGT CGCCAAGTAG GTGTTCCTTA CATCATCGTT 100 

10 TTCCTGAACA AATGTGACAT GGTTGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCAGGCGACG 2 00 

ACACTCCAGT TGTTCGYGGT TCAGCACTGA AAGCGC TGG A AGGTATCCCT 2 50 

GAGTGGGAAG CGAAAATTGT TGAATTAGCC GGTTACCTGG ACAGCTACAT 3 00 

CCCAGAACCA GAGCGCGCAA TTGACCGTCC ATTCCTGCTG CCAATCGAAG 350 

15 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG SCGTGTTGAG 400 

CGTGGTATCG TTAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 45 0 

CACRGTTAAA ACAACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTTG 500 

ACGAAGGTCG TGC TGGTG AG AACGTTGGTG TTTTACTGCG TGGTACTAAG 550 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCTAAACCAG GTTCAATCAA 600 

20 GCCACACACT ACTTTTGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTC TACTTC 7 00 

CGTACAACTG ACGTAAC CGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGC C A GGCGACAACA TCAACATGAT CGTGACACTG ATTCACCCAA 800 

TCGCGATGGA TGATGGTTTA CGTTTC 82 6 

25 



2 ) INFORMATION FOR SEQ ID NO: 152 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia. stuartil 
40 (B) STRAIN: ATCC 33672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 

CGGTGCAATC CTAGTTGTTG CGGCAACAGA TGGCCCAATG CCACAAACTC 50 

45 GTGAGCACAT CCTGTTAGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTG 100 

TTCCTGAACA AATGTGACAT GGTAG AC G AC GAAGAGCTGC TGGAACTGGT 15 0 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGTGATG 2 00 

ACACTCCAGT TATCCGTGGT TCAGCGCTGA AAGCGTTGGA AGGCAACCCA 2 50 

GAGTGGGAAG CGAAAATCGT TGAACTAGCA GAAGCACTGG ACAGCTACAT 300 

50 CCCAGAGCCA GAGCGTGCAA TTGACAAGCC ATTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC AATCTCAGGT CGTGGTACAG TAGTCACAGG CCGTGTTGAG 400 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTAG GTATCAAAGA 450 

GACTGCGAAA ACCACTTGTA CTGGCGTTGA AATGTTCCGT AAACTGCTGG 500 

ACGAAGGCCG TGC GGGTGAG AACGTAGGTG TTCTGCTGCG TGGTACTAAG 550 

55 CGTGAAGAAA TCGAACGTGG TCAAGTTCTG GCGAAACCAG GTTCAATCAA 600 

GCCACACACA ACTTTCGAAT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACGCCATTC TTCAAAGGYT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTAACAGG TACTATCGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGCGACAACG TGAACATGAA AGTAACTCTG ATTCACCCAA 800 

60 TCGCGATGGA CGATGGTTTG CGTTTCGCAA 83 0 

295 



CA 02307010 2000-05-19 



2) INFORMATION FOR SEQ ID NO: 153 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: ATCC 35554 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

2 0 CCTGGTTTGC TCGGCTGCCG ACGGCCCCAT GCCGCAGACC CGCGAGCACA 50 

TCCTGCTGTC CCGCCAGGTA GGCGTTCGCT ACATGGTCGT STTCGTGAAC 1G0 

AAAGCCGACA TGGTCGACGA CGCCGAGCTG CTGGAACTGG TCGAGATGGA 150 

AGTTCGCGAT C TGC TGAAC A CCTACGACTT CCCGGGCGAC GACACTCCGA 200 

TCATCATCGG TTCCGCGCTG ATGGCGCTGG AAGGCAAGGA TGACAACGGC 250 

25 ATCGGCGTAA GCGCCGTGCA GAAGCTGGTA GAGACCCTGG ACTCCTACAT 300 

TCCGGAGCCG GTTCGTGCCA TCGACCAGCC GTTCCTCATG CCGATCGAAG 350 

ACGTGTTCTC GATCTCCGGT CGCGGTACCG TGGTAACCGG TCGTGTAGAG 400 

CGCGGCATCA TCAAGGTCCA GGAAGAAGTG GAAATCGTCG GCATCAAGGC 4 50 

GACCACCAAG ACCACCTGCA CCGGCGTTGA AATGTTCCGC AAGCTGCTCG 5 00 

30 ACGAAGGTCG TGCTGGTGAG AACGTTGGTA TCCTGCTGCG TGGCACCAAG 550 

CGTGAAGACG TAGAGCGTGG CCAGGTACTG GCCAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAGT GCGAAGTGTA CGTGCTGTCC AAGGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTCAAGGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACCACKG ACGTGACCGG TAMCTGCGAG CTGCCGGAAG GCGTAGAGAT 7 50 

35 GGTAATGCCG GGCGACAACA TCAAGATGGT TGTCACCCTG ATCGCTCCGA 800 

TCGCCATGGA AG ATGGC TGC GTTCGCG 827 



40 2 ) INFORMATION FOR SEQ ID NO: 154 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas fluorescens 

(B) STRAIN: ATCC 13525 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 



CGGCGCAATC CTGGTTTGCT CGGCCGCTGA TGGTCCGATG CCACAAACCC 50 

GTGAACACAT CCTGCTGTCC CGTCAGGTTG GCGTTCCGTA CATCGTGGTT 100 

TACCTGAACA AGGCTGACCT GGTAGACGAC GCTGAGCTGC TGGAACTGGT 150 

TGAGATGGAA GTGCGCGATC TGCTGAGCAC TTACGACTTC CCAGGCGACG 2 00 

60 ACACTCCGAT CATCATCGGT TCTGCTCGTA TGGCTCTGGA AGGCAAAGAC 2 50 
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GACAACGAAA TGGGCACCAC GTCCGTTCGT AAACTGGTTG AAACTCTGGA 3 00 

CAGCTACATC CCAGATCCAG TTCGTGTTAT CGACAAGCCG TTCCTGATGC 3 50 

CAATCGAAGA CGTGTTCTCG ATCTCCGGTC GCGGTACTGT TGTGACTGGT 400 

CGTATCGAGC GCGGTATCGT TAAGGTTCAA GATCCACTGG AAATCGTTGG 4 50 

5 TCTGCGTGAC ACTACCGTCA CCACCTGCAC CGGTGTTGAA ATGTTCCGTA 500 

AGCTGCTCGA CGAAGGTCGT GCTGGCGAGA ACTGCGGCGT TCTGCTGCGT 550. 

GGTACCAAGC GTGACGACGT TGAGCGTGGC CAGGTTCTGG TTAAGCCAGG 600 

TTCGGTTAAG CCGCACACCA AGTTCGAAGC TGAAGTCTAC GTACTGAGCA 650 

AAGAAGAAGG CGGTCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCACAG 700 

10 TTCTACTTCC GTACTACTGA CGTGACTGGT AACTGCGAGC TGCCGGAAGG 750 

CGTTGAAATG GTTATGCCAG GCGACAACAT CAAAATGGTT GTTACCCTGA 800 

TCAAAACCAT CGCAATGGAA GACGGTCTGC GTTTCGCTAT T .841 



15 2 ) INFORMATION FOR SEQ ID NO: 155 



20 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



30 



35 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomorxas stutzerl 

(B) STRAIN: ATCC 17588 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

CGGCGCGATC CTGGTCTGCT CGGCTGCTGA CGGCCCCATG 
GCGAGCACAT CCTGCTGTCC CGTCAGGTTG GTGTTCCGTA 
TTCCTGAACA AGGCCGACAT GGTTGATGAC GCCGAGCTGC 
CGAGATGGAA GTTCGYGACC TGCTGTCGAC CTACGACTTC 
ACACTCCGAT CATCATCGGC TCCGCGCTGA TGGCGCTGAA 
GACAACGAGC TCGGCACCAC TGCGGTGAAG AAGCTGGTCG 



CAGCTACATT CCCGAGCCGG 
CGATCGAAGA CGTGTTCTCG 
CGCGTAGAGC GCGGCATCGT 
TCTGCGTCCG ACCACCAAGA 
AGCTGCTCGA YGARGGTCGT 
GGCACCAAGC GTGACGAAGT 
CACCATCAAG CCGCACACCA AGTTCGAAGC 
AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT 



TTC GTGC CAT CGACAAGCCG 
ATCTCCGGTC GCGGCACSGT 
CAAGGTTCAG GAAGAGATCG 
CTACCTGCAC CGGCGTTGAG 
GCTGGCGAGA ACTGCGGYGT 
GGAGCGTGGT CAGGTTCTGG 
CGAAGTGTAC 
TYAAGGGCTA 



TTCTACTTCC GTACCACTGA YGTGAC YGGW TCGTGCGARC 
CGTCGAGATG GTAATGCCGG GCGACAACGT GAAGATGGTT 
TCAAGCCGAT CGCCATGGAA GACGGCCTGC GCTTCGCGAT 



CCGCAGACTC 50 

CATCGTCGTG 100 

TCGAGCTGGT 150 

CCGGGTGAYG 200 

CGGCGAAGAC 250 

AGACCCTGGA 3 00 

TTCCTGATGC 350 

GGTAACCGGT 400 

AGATCGTCGG 45 0 

ATGTTCCGCA 5 00 

GCTGCTGCGT 550 

CCAAGCCGGG 600 

GTGCTGTC C A 65 0 

CCGTCCKCAG 7 00 

TGCCGGAAGG 75 0 
GTCACCCTGA . 800 

T 841 



50 

2 ) INFORMATION FOR SEQ ID NO : 156 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 
.(B) STRAIN: ATCC 23333 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

GC TATTC TAG TAGTATCAGC AACTGACGGT CCAATGCCAC AAACACGTGA 50 

GCACATTCTA TTATCACGTC AGGTTGGTGT ACCATACATC ATCGTATTCA 100 

TGAACAAATG TGACATGGTA GATGACGAAG AGTTACTAGA GCTAGTAGAA 150 

10 ATGGAAGTGC GTGAATTACT TTCAGACTAC GACTTCCCAG GTGATGACAC 2 00 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAATGGT AACGACGGTA 250 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT AGACACTTAC 3 00 

ATTCCAGAGC CAGAGCGTGA CATC G AT AAG CCATTCCTAA TGCCAATCGA 350 

AGACGTATTC TCAATCTCAG GTCGTGGTAC AG T AGTAAC A GGCCGTGTTG 400 

15 AATCTGGTAT CATCAAAGTT GGTGACGAAA TCGAAATCGT TGGTATCAAA 450 

G AC AC AGTTA AAACAACTTG TACTGGTATC GAGATGTTCC GTAAGTTACT 5 00 

AGACGAAGGT CGTGCTGGTG AG AAC TGTGG TGTACTATTA CGTGGTACTA 550 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGCTAAGCC AGGTTCAATC 600 

ACTCCACACA CCAACTTCGA CGCAGAAGTA T ACGT AC TAT CAAAAGAAGA 650 

20 AGGTGGTCGT CACACTCCAT TCTTAAATGG TTACCGTCCA CAGTTCTACT 7 00 

TCCGTACTAC TGACGTAACA GG'TGCAATCA CGTTACAAGA AGGTACTGAA 750 

ATGGTAATGC CAGGCGATAA CGTTGAGATG AGCGTAGAGC TAATCCACCC 800 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA ATC 833 



25 



40 



60 



2 ) INFORMATION FOR SEQ ID NO: 157 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatllls 

(B) STRAIN: ATCC 33071 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 157 



GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATAC ATGATCGTGT 100 

45 TCATGAACAA ATGCGACATG GTAGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT TCTGTCTGCT TACGAATTCC CAGGCGACGA 200 

CATCCCGGTC ATCAAAGGTT CAGCGCTGAA AGCACTGGAA GGCGATGCTA 250 

CTTGGGAAGC GAAAATCATC GAACTGGCAG AAGCACTGGA CAGCTACATT 3 00 

CC ATTGCC AG AGCGTGCTAT CGATAAGCCA TTCCTGCTGC CAATCGAAGA 3 50 

50 CGTATTCTCC ATCTCCGGTC GTGGTACAGT GGTTACCGGT CGTGTAGAGC 400 

GCGGTATCGT TAAAGTGGGC GAAGAAGTTG AAATCGTCGG TATCAAGGAC 450 

ACTGTTAAGT C T AC TTGT AC TGGCGTTGAA ATGTTC CGC A AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

55 CCACACACCA AG TTTG ATTC CGAAGTGTAC ATCCTGAGCA AAGATGAAGG 6 50 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTTATGCCTG GTGACAACGT GAACATGGTT GTTACCCTGA TCCACCCAAT 800 

CGC G ATGGAT GACGGTCTGC GTTTC 82 5 
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10 



2) INFORMATION FOR SEQ ID NO: 158 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuls subsp. arizonae 
15 (B) STRAIN : ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

2 0 GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 15 0 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACG AC TTC CCGGGCGACG 2 00 

ATACGC CG AT CGTTCGTGGT TCTGCTC TG A AAGCGCTGGA AGGCGACGCA 2 50 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 3 00 

25 CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

3 0 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTAC T AC TG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

35 TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



40 



55 



2 ) INFORMATION FOR SEQ ID NO: 159 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Salmonella choleraesuls subsp, choleraesuls 

serotype Choleraesuis 
(B) STRAIN: ATCC 7001 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 159 



CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

60 ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGTGACGCA 250 

299 



CA 02307010 2000-05-19 



GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCA TCAAAGTGGG YGAAGAAGTT GAAATCGTTG GTATCAAAGA 45 0 

5 GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAAC TGTTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 5 50 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GC AC CATC AA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

10 CGTACTACTG ACGTGACTGG C AC CATC G AA CTGCCGGAAG GCGTAGAGAT 7 50 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO : 160 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 807 bases 
20 (B) TYPE: Nucleic acid 

( C ) STEANDEDNES S : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuls subsp. diarizonae 

(B) STRAIN: ATCC 43973 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

35 AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 200 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 2 50 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 3 50 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

40 ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 45 0 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 5 00 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCAC CATC A AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ATATTC TGTC CAAAGATGAA GGCGGCCGTC 650 

45 ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 7 00 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 807 



2 ) INFORMATION FOR SEQ ID NO: 161 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Heidelberg 
5 (B) STRAIN: ATCC 8326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

10 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGC G AC AT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ACACTCCGAT CGTTCGTGGT TC TGCTCTG A AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

15 CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AAC GTAGGTG TTCTGCTGCG TGGTATCAAA 5 50 

2 0 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG CACCATCGAA TTGCCGGAAG GCGTAGAGAT 75 0 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 8 00 

25 TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



30 



2 ) INFORMATION FOR SEQ ID NO : 162 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 

45 CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCC TGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACGCCGA 200 

TCGTGCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAATGGGAA 250 

50 GC GAAAATC A TCGAACTGGC TGGCTACCTG GATTCTTACA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAAC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CTATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCGACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

55 GTGCTGGCGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGC AC CATC A AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 65 0 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AATTCTACTT CCGTACGACT 700 

GACGTGACTG GCACCATCGA ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 

60 GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 
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ACGACGG 807 



5 2 ) INFORMATION FOR SEQ ID NO: 163 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp . indica 

(B) STRAIN: ATCC 43976 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 



20 



CGCGATCCTG QTTGxTGCTG CGACTSAOSS CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AAC TGGTTG A 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 2 00 

25 CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGG AAGC G A AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 3 00 

GG AAC C AG AG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 3 50 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 400 

GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 450 

30 TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 

GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 600 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGC T AC C GTCCGCAGTT CTACTTCCGT 7 00 

35 ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 7 50 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC C ATCC GAT CG 800 

CRATGGACGA CGGTCTGCGT TTCGCAA 827 

40 

2 ) INFORMATION FOR SEQ ID NO: 164 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 807 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 164 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

60 AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCGA 2 00 
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TCGTGCGTGG TTCCGCTCTG 
GMGAAAATCA TCGAACTGGC 
AGAGCGTGCG ATTGACAAGC 
CCATCTCCGG TCGTGGTACC 
ATCAAAGTGG GCGAAGAAGT 
GTCTACCTGT ACTGGCGTTG 
GTGCCGGTGA GAACGTAGGT 
ATCGAAC GTG GTCAGGTACT 
CAAGTTCGAA TCTGAAGTGT 
ATACTCCGTT CTTCAAAGGC 
GACGTGACTG GCACCATCGA 
GGGCGACAAC ATCAAAATGG 
ACGACGG 



AAAGCGCTGG AAGGCGAMGC 
TGGCTWCCTG GATTCTTACA 
CGTTCCTGCT GCCGATCGAA 
GTTGTTACCG GTCGTGTAGA 
TGAAATCGTT GGTATCAAAG 
AAATGTTCCG CAAACTGCTG 
GTTCTGCTGC GTGGTATCAA 
GGCTAAGCCG GGCACCATCA 
ACATTCTGTC CAAAGATGAA 
TACCGTCCGC AGTTC TACTT 
ACTGCCGGAA GGCGTGGAGA 
TTGTTACCCT GATCCACCCG 



TGAGTGGGAA 


250 


TTCCGGAACC 


300 


GACGTATTCT 


350 


GCGCGGTATC 


400 


AGACTCAGAA 


450 


GACGAAGGCC 


500 


ACGTGAAGAA 


550 


AGCCGCACAC 


600 


GGCGGCCGTC 


650 


CCGTACCACT 


700 


TGGTAATGCC 


750 


ATCGCGATGG 


800 




807 



2 ) INFORMATION FOR SEQ ID NO : 165 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: ^ , 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Typhi 

(B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

CGGCGCGATC C TGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 2 50 

GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

AC G AAGGC CG TGCNGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACY AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACT AC TG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 7 50 

GGTAATGC CG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 166 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 166 

10 CGGCGCTATC CTGGTTGTAG CTGCGACTGA CGGCCCTATG CCTCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTA CATCATCGTA 10 0 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 15 0 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTATGACTTC CCTGGTGATG 2 00 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAAGCT 25 0 

15 GAGTGGGAAG CTAAAATCAT CGAGCTGGCC GGTCACCTGG ATTCCTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGATCAGCC GTTCCTGCTG CCAATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TAGTTACCGG TCGTGTTGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 45 0 

CACCGTTAAG TCTAC C TGT A CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

20 ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAG 55 0 

CHTGAAGACA TCGAACGTGG TCAGGTACTG GCTAAACCAG GTTCCATCAA 600 

GCCGCACACT CAGTTCGATT CAGAAGTGTA TATCCTGAGC AAAGAAGAAG 65 0 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 750 

25 GGTAATGCCA GGCGATAACG TGAACATGGT TGTTACCCTG ATCCACCCAA 800 

TCGCTATGGA CCAAGGC 817 



3 0 2 ) INFORMATION FOR SEQ ID NO: 167 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 



GCTGCGACTG ACGGCCCAAT GCCTCAGACC CGTGAGCACA TCCTGCTGGG 50 

TCGTCAGGTT GGCGTTCCTT TCATCATCGT ATTCATGAAC AAATGCGACA 100 

TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAAATGGA AGTTCGTGAA 15 0 

CTTCTGTCTG CTTACGACTT CCCTGGTGAT GACCTGCCGG TTGTTCGTGG 2 00 

50 TTCAGCGCTG AAAGCACTGG AAGGCGAAGC TGAGTGGGAA GCTAAAATCA 2 50 

TCGAGCTGGC CGGTTACCTG GATTCTTACA TCCCAGAACC AGAGCGTGCT 3 00 

ATCGACAAGC CGTTCCTGCT GCCAATCGAA GACGTCTTCT CCATCTCCGG 3 50 

TCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGTATC GTTAAAGTTG 400 

GCGAAGAAGT TGAAATCGTT GGTATCAAAG ACACCGTTAA GTCTACCTGT 450 

55 AC TGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC GTGCTGGTGA 500 

GAACGTTGGT GTTCTGC TGC GTGGTATCAA GCGTGAAGAC ATCGAACGTG 550 

GTCAGGTACT GGCTAAACCA GGTTCAATCA AGCCACACAC CAAGTTCGAC 600 

TCAGAAGTGT ACATCCTGAG CAAAGAAGAA GGTGGTCGTC ATACTCCATT 650 

CTTCAAAGGC TACCGTCCAC AGTTCTACTT CCGTACAACT GACGTGACCG 7 00 

60 GTACCATCGA ACTGCCAGAA GGCGTTGAAA TGGTAATGCC AGGTGACAAC 7 50 
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GTGAACATGG TTGTTACCCT GATCCACCCA ATCGCGA 787 



5 2 ) INFORMATION FOR SEQ ID NO: 168 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 



20 



GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 2 00 

25 AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 2 50 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 3 00 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTACCG 3 50 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 400 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 450 

3 0 CAAACTGCTG GAC GAAGGC C GTGC TGGTG A GAACGTAGGT GTTCTGCTGC 500 

-GTGGTATCAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTTCTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 7 00 

3 5 GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 745 



2 ) INFORMATION FOR SEQ ID NO: 169 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Serratia odor if era 

(B) STRAIN: ATCC 33077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

55 GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 5 0 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 200 

CCTGCCAGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 25 0 

60 AGTGGGAAGC TAAGATTGTA GAACTGGCTG AAGCGCTGGA TTCTTACATC 300 
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CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGTC GTGGTACCGT 
GCGGTATCAT CAAAGTTGGC GAAGAAGTTG 
ACCGTTAAGT CTACCTGTAC CGGTGTAGAA 
5 CGAAGGC CGT GCTGGTGAGA ACGTTGGTGT 
GTGAAGACAT CGAACGTGGT CAGGTTCTGG 
CCGCACACCA AATTCGACTC AGAAGTGTAC 
TGGTCGTCAC ACGCCATTCT TCAAAGGCTA 
GTACTACTGA CGTGACCGGT ACCATCGAAC 
10 GTAATGCCAG GCGATAACGT GAACATGGTT 
CGCAATGGAC GACGGTCTGC GTTTCGCAA 



TTCCTGCTGC CAATCGAAGA 3 50 

TGTTACCGGT CGTGTTGAGC 40 0 

AAATCGTTGG TATCAAAGAC 450 

ATGTTCCGCA AACTGCTGGA 500 

TCTGCTGCGT GGTATCAAGC 55 0 

CTAAACCAGG TTCTATCAAG 600 

ATCCTGAGCA AAGAAGAAGG 65 0 

CCGTCCACAG TTCTACTTCC 7 00 

TGCCAGAAGG CGTAGAGATG 7 50 

GTTACCCTGA TTCACCCAAT 800 

829 



15 2 ) INFORMATION FOR SEQ ID NO: 170 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Double 

(D) xGFGLGGY ; Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 0 



CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTT CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTACGACTTC CCTGGTGATG 2 00 

35 AC CTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAACCA 2 50 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCT GGTTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGACAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

AC GTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTTGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

40 CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTGGGTG TTCTGCTGCG TGGTATCAAG 550 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACC AAGTTTG AC T CAGAAGTGTA CATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 7 00 

45 CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGAAG GCGTAGAGAT 7 50 

GGTAATGCCA GGTGACAACG TGAACATGGT TGTAACCCTG ATCCACCCAA 8 00 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA 83 0 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 171 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 

GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTACCTTAC ATCATCGTAT 100 

TCATGAACAA ATGCGACATG GTAGATGATG AAGAGCTGCT GGAACTGGTA 15 0 

10 GAGATGGAAG TTCGCGAACT GCTGTCTGCT TACGACTTCC CAGGCGACGA 200 

CCTGCCGGTA ATCCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAAATCGTT GAGCTGGCAG AAGCGCTGGA CAGCTACATC 3 00 

CCAGAGCCAG AGCGTGCTGT AGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

C GTATTC TCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 400 

15 GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTAGG TATCAAAGAC 450 

AC CGTT AAGT CTACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CGAAGCCAGG TTCAATCAAG 600 

CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

2 0 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 7 50 

GTAATGCCAG GCGACAACGT GAACATGAAA GTTACTCTGA TTCACCCAAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 82 9 



2) INFORMATION FOR SEQ ID NO: 172 

( i ) SEQUENCE CHARACTERISTICS : 
3 0 (A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydli 

(B) STRAIN: ATCC 92 07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 



CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

4 5 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTC TGTCTC A GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 2 50 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

50 ACGTATTCTC CATC TCC GGT CGTGGTACCG TTGTT AC CGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

5 5 GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTC 82 6 
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2 ) INFORMATION FOR SEQ ID NO: 173 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 
15 (B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 5 0 

20 CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 200 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 250 

GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCYTAYATT CCGGAACCAG 300 

25 AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 350 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 400 

CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACYCAGAAGT 450 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

30 CGAACGTGGT CAGGTACTGG CGAAGCCRGG CACCATCAAG CCGCACACCA 600 

"AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 7 00 

CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT C G CGATGG AC 800 

35 GACGGTCTGC GTTTCGCA 818 
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2) INFORMATION FOR SEQ ID NO: 17 4 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shigella flexnerl 

(B) STRAIN: ATCC 12022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 

55 CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC G AC AC T C CG A 2 00 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AG AGTGGGAA 250 

60 GCGAAAATCC TGGAACTGGC TGGCTTCCTG GATTCTTACA TTCCGGAACC 300 

308 
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AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATC GAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACG 



ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GTGGTATCAA 
GGC AC CATC A 
CAAAGATGAA 
AGTTCTACTT 
GGCGTAGAGA 
GATCCACCCG 



GACGTATTCT 3 50 

ACGCGGTATC 400 

AG AC TC AG AA 450 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 600 

GGCGGMCGTC 650 

CCGTACTACT 7 00 

TGGTAATGCC 750 

ATCGCGATGG 800 
806 



15 2 ) INFORMATION FOR SEQ ID NO: 175 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 



• CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG GCGCAGACTC 5 0 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 10 0 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 15 0 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

35 ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 25 0 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 45 0 

40 G AC TC AGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 65 0 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

45 CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 7 50 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TC GCGATGG A CGACGGTCTG CGTTTCGCAA TC 832 



50 

2 ) INFORMATION FOR SEQ ID NO: 176 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 716 bases 
55 (B) TYPE: Nucleic acid 

{ C ) STRANDEDNES S : Doubl e 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

TCTGCTGCTG ACGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC 5 0 

ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC AAAGTTGACA 100 

TGGTTGAC G A TGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTC GTG AC 15 0 

10 TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG TAATCGCTGG 2 00 

TTCAGCATTA AAAGC TTT AG AAGGCGATGC TCAATACGAA GAAAAAATCT 2 50 

TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC AGAACGTGAT 3 00 

TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG 3 50 

TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG 400 

15 GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA AACAACTGTT 450 

ACAGGTGTTG AAATGTTCCG TAAATTATTA G ACT ACGC TG AAGCTGGTGA 500 

CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG 550 

GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC TGAATTCAAA 600 

GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC ACACTCCATT 65 0 

20 CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT GACGTAACTG 7 00 

GTGTTGTTCA CTTACC 716 



2 5 2 ) INFORMATION FOR SEQ ID NO: 177 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 719 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 



TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 5 0 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 15 0 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 2 00 

45 GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 25 0 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 300 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 400 

AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

50 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC ' 55 0 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 60 0 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMC TG AAA TGGTAATGCC 7 00 

55 TGGTGATAAC GTTGAAATG 719 



60 



2 ) INFORMATION FOR SEQ ID NO: 178 

310 



CA 02307010 2000-05-19 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 33591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 8 

15 GTG AAC AC AT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTG AC GAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGC G A ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 2 00 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 2 50 

20 TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 00 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 3 50 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 4 00 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 4 50 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

2 5 CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTC AATT AC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 62 5 
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2 ) INFORMATION -FOR SEQ ID NO: 179 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 7 04 bases 
3 5 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 

GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 15 0 

50 TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 2 00 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 250 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 300 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 3 50 

AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 4 00 

55 TTGAAATGTT C CGT AAATTA TTAGACTACG CTGAAG CTGG TGACAACATT 4 50 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGTCAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA T AC TGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 

60 TCACTTACCA G AAGGT AC TG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 700 
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TGAC 



704 



2 ) INFORMATION FOR SEQ ID NO : 180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 6538 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAAGAG-AT TCTTTTATCA CGTftACGTTG GTGxACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT T ATTAAGC G A ATATGACTTC CCAGGTGACG 15 0 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGC TTT AG A AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 350 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 400 

CACATCTAAA AC AAC TGTT A CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG -TACAACGTGG TCAAGTATTA .G.CTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

CGTACTACTG AC GTAACTGG TGTTGTTCAC TTACCAGAAG GTACTGAAAT 7 00 

GGTAATGCCT GGTGATAACG TTGAAATGAC 730 



2 ) INFORMATION FOR SEQ ID NO: 181 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 4 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricular is 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTAC C AGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTG AC C A AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGC G A AT ACG AC T AC CCAGGTGACG 2 00 

ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGC G AC AAA 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAACAAGTTG ACGATTACAT 3 00 
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TCCAACTCCA G AACGTG AC T 
ACGTATTCTC AATCACTGGT 
CGTGGTCAAA TCAAAGTCGG 
CGGTTCACAA AAAACAACAG 
5 TAGACTACGC TGAAGCTGGT 
TCACGTGAAG AAGTACAACG 
TACACCACAC ACTAAATTCA 
AAGGTGGACG TCACACTCCA 
TTCCGTACTA CTGACGTAAC 
10 AATGGTAATG CCTGGCGATA 
CAATCGCTAT CGAAGACGGT 



CTGATAAACC ATTCATGATG 
CGTGGTACTG TTGCAACAGG 
TGAAGAAGTT GAAATCATCG 
TTACTGGTGT AGAAATGTTC 
GACAACATCG GTGCTTTATT 
TGGTCAAGTT TTAGCTGCTC 
CTGCAGAAGT TTACGTATTA 
TTCTTCTCTA ACTACCGTCC 
AGGTGTTGTT AC TTTACC AG 
ACGTTAAAAT GGAAGTTGAA 
ACTCGTTTCT CAAT 



CCAGTTGAAG 


350 


CCGTGTTGAA 


400 


GTATGAAAGA 


450 


CGTAAATTAT 


500 


ACGTGGTATT 


550 


CTGGTTCAAT 


600 


TCTAAAGATG 


650 


ACAATTCTAT 


700 


AAGGTACAGA 


750 


TTAATTTCTC 


800 




834 



15 2 ) INFORMATION FOR SEQ ID NO: 182 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTG AAC AC AT CTTATTATCA CGTAAC GTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT T AG AATT AG T 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 200 

3 5 ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 2 50 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

40 AACTTCTAAA ACAACTGTTA C TGGTGT AG A AATGTTCCGT AAATTATTAG 500 

AC T AC GCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 7 00 

45 CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 7 50 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 835 



50 

2 ) INFORMATION FOR SEQ ID NO : 183 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 804 bases 
55 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 



313 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Macrococcus caseolyticus 

(B) STRAIN: ATCC 13548 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 

GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATGCCACA AACTCGTGAA 50 

CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG TAGTATTCTT 100 

GAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTTGAAA 150 

10 TGGAAGTTCG TGACTTATTA TCTGAATATG ACTTCCCTGG TGACGATGTA 2 00 

CCTGTAATCG CTGGATCTGC TTTAAAAGCA TTAGAAGGCG TTGAAGAATA 250 

CGAAGACAAA ATCATGGAAT TAATGGACGC AGTTGATGAG TACATCCCAA 3 00 

CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACAGTTGCA ACTGGACGTG TTGAGCGTGG 4 00 

15 ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGAAGAAC 450 

CAGCAAAAAC TACAGTTACA GGTG TAG AAA TGTTCCGTAA ATTATTAGAT 500 

TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTACGTG GTGTTTCTC G 550 

TGAAGACGTA CAACGTGGAC AAGTATTAGC TAAACCAGGT TCAATTACTC 600 

CACATACTAA ATTCAAAGCT G AAGTTT AC G TATTATCTAA AGAAGAAGGT 650 

2 0 GGACGTCATA CTCCATTCTT C ACT AAC TAC CGCCCTCAGT TCTACTTCCG 7 00 

TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAAGGT AC TG AAATGG 7 50 

TAATGCCTGG AGATAACATC GAAATGAACG TTGAATTAAT TTCTCCAATC 800 

GCGA 804 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 184 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) ^LENGTH: 832 bases 

(B) TYPE": Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnli 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 5 0 

GTGAAC ATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

45 TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA AT ATG AC TTC CCAGGTGACG 200 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATG AC TTC AT 3 00 

TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

50 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 450 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 550 

GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

55 TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT AC TTT AC C AG AAGGTACTGA 75 0 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

CAATCGCTAT CGAAGACGGT AC ACGTTTC T CT 83 2 



60 
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2 ) INFORMATION FOR SEQ ID NO: 185 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 699 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

2 0 ATCTGCTGCT GACGGTCCAA TGCCACAAAC TCGTGAACAC ATCTTATTAT 50 

CACGTAACGT TGGTGTACCA GCATTAGTTG TATTCTTAAA CAAAGTTGAC 100 

ATGGTAGACG ACGAAGAATT ATTAGAATTA GTTGAAATGG AAGTTCGTGA 15 0 

CTTATTAAGC GAATATGACT TCCCAGGTGA CGATGTACCT GTAATCGCTG 2 00 

GTTCTGCATT AAAAGCATTA GAAGGCGATG CTGAATACGA ACAAAAAATC 2 50 

25 TTAGACTTAA TGCAAGCAGT TGATGATTAC ATTCCAACTC CAGAACGTGA 3 00 

TTCTGACAAA CCATTCATGA TGCCAGTTGA GGACGTATTC TCAATCACTG 35 0 

GTCGTGGTAC TGTTGCTACA GGCCGTGTTG AACGTGGTCA AATC AAAGTT 400 

GGTGAAGAAG TTGAAATCAT CGGTATGCAC GAAACTTCTA AAACAACTGT 450 

TACTGGTGTA GAAATGTTCC GTAAATTATT AG ACT AC G CT GAAGCTGGTG 500 

3 0 ACAACATCGG TGC TTTATT A CGTGGTGTTG CACGTGAAGA CGTACAACGT 550 

GGTCAAGTAT TAGCTGCTCC TGGTTC T ATT ACACCACACA CAAAATTCAA 600 
AGCTGAAGTA TACGTATTAT CTAAAGATGA AGGTGGACGT CACACTCCAT 650 
TCTTCACTAA CTATCGCCCA CAATTCTATT TCCGTACTAC TGACGTAAC 699 



35 



50 



2 ) INFORMATION FOR SEQ ID NO: 186 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

55 TTCTTAAATA AAGTTGACAT GGTTGACGAT GAAGAATTAT T AG AATT AG T 150 

TGAAATGGAA GTACGTGACT TATTATCTGA AT AC G ACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGC TTTAG A AGGCGATGCT 250 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATG AC TAC AT 300 

TCCAACTCCA GAACGTGATT CTQACAAACC ATTCATGATG CCAGTTGAGG 3 50 

60 ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 
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CGTGGGCAAA TCAAAGTTGG 
CACTTCTAAA ACAACTGTTA 
ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
5 ACCTCACACA AAATTTAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGCGACAACG 
TCGCGATTGA AGACGGTACT 



TGAAGAAGTT GAAATCATTG 
CTGGTGTAGA AATGTTCCGT 
AACATCGGTG CATTATTACG 
TCAAGTATTA GCTGCTCCAG 
C AG AC G TATA CGTTTTATCT 
TTCACAAACT ATCGTCCACA 
TGTTGTTAAC TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCA 



GTATCCATGA 4 50 

AAATTATTAG 5 00 

TGGTGTTGCT 550 

GTTCAATC AC 60 0 

AAAGAC G AAG 650 

ATTCTATTTC 7 00 

GTACTGAAAT 750 

ATCGCTCCTA 800 
829 



2) INFORMATION FOR SEQ ID NO: 187 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 
25 (B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 50 

3 0 TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA J3AAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 2 00 

GGCGACGAAA AATACGAAGA AAAAATC TT A GAATTAATGC AAGC AG TTG A 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 300 

35 CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 400 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT AC AAC GTGGT CAAGTATTAG CTGCTCCTGG 550 

40 TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTAC TACTGA CGTAACTGGC GTTGTTCAAT TACCAGAAGG 7 00 

TACTG 705 



45 



2 ) INFORMATION FOR SEQ ID NO: 188 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 23 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 

TTTTATCACG TAACGTTGGT GTACCAGCAT TAGTAGTATT CTTAAACAAA 50 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTTG AAATGGAAGT 100 

5 ACGTGACTTA TTATCTGAAT ACGACTTCCC AGGTGACGAC GTACCTGTAA 150 

TCGCTGGTTC AGCTTTAAAA GCTTTAGAAG GCGATGCTCA ATACGAAGAA 2 00 

AAAATC TTAG AATTAATGCA AGCAGTTGAT GATTACATTC CAACTCCAGA 2 50 

ACGTGACTCT GATAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 3 00 

TCACTGGTCG TGGTACTGTT GCTACAGGTC GTGTTGAACG TGGTCAAATC 350 

10 AAAGTTGGTG AAGAAGTTGA AATTATTGGT ATCAAAGAAA CTTCTAAAAC 400 

AACTGTTACT GGTGTAGAAA TGTTC CGTAA ATTATTAGAC TACGCTGAAG 450 

CTGGTGACAA CATCGGTGCT TTATTACGTG GTGTTGCTCG TGAAGATGTA 500 

CAACGTGGTC AAGTATTAGC TGCTCCAGGT TCAATTACAC CTCACACAAA 550 

ATTCAAAGCA GACGTATACG TTTTATCAAA AGATGAAGGT GGACGTCATA 600 

15 CTCCATTCTT CACTAACTAT CGTCCACAAT TCTATTTCCG TACTACTGAC 650 

GTAACTGGTG TTGTTAACTT ACCAGAAG 67 8 



20 2 ) INFORMATION FOR SEQ ID NO: 189 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: • Staphylococcus haemolyticus 

(B) STRAIN: CSG 33 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 189 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 5 0 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAG AAGGC GATGCTCAAT AC GAAG AAAA AATCTTAGAA TTAATGCAAG 2 00 

40 CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 2 50 

ATGATGCCAG TTGAGGACGT ATTCTCAATC AC TGGTCGTG GTACTGTTGC 3 00 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 400 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

4 5 ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 

50 

2) INFORMATION FOR SEQ ID NO: 190 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
5 (B) STRAIN: CSG 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TTGAAATGGA 50 

10 AG T AC GTG AC TTATTATCTG AATACGACTT CCCAGGTGAC GATGTACCTG 100 

TAATCGCTGG TTCAGCATTA AAAGC TTTAG AAGGCGATGC TCAATACGAA 150 

GAAAAAATCT TAGAATTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 2 00 

AG AAC GTG AT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 250 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 3 00 

15 ATCAAAGTTG GTGAAGAAGT TGAAATCATT GGTATCCATG ACACTTCTAA 3 50 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 400 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 450 

GTACAACGTG GTCAAGTATT AGCTGCTCCA GGTTCAATCA CACCTCACAC 500 

' AAAATTTAAA G C AG AC GTAT AC GTTTT ATC TAAAGACGAA GGTGGACGTC 550 

2G ACAG^rvATT CTTCACAAAC TATCGTCCAC AATTCTATTT CCG 593 



25 



30 



35 



2 ) INFORMATION FOR SEQ ID NO: 191 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: t . . 

(A) ORGANISM: Staphylococcus homims subsp. hominis 

(B) STRAIN: ATCC 27844 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 



40 CGGCGCTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACGTAC CTGT 

4 5 CAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AACTTC T AAA 

50 ACT ACGC TG A 
CGTGAAGATG 
ACCTCACACA 
GTGGACGTCA 
CGTACTACTG 

55 GGTAATGCCT 
TCGCGATTGA 



TTAGTAGTAT 
TCTTTTATCA 
AAGTTGACAT 
GTACGTGACT 
AATCGCTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTTGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
AAATTCAAAG 
TACTCCATTC 
ACGTAACTGG 
GGTGACAACG 
AG ACGGTAC T 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTATCTGA 
TCAGCTTTAA 
AGAATTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CTGGTGTAGA 
AACATCGGTG 
TCAAGTATTA 
C AG AC GTAT A 
TTCTCTAACT 
TGTTGTTAAC 
TTGAAATGAC 
CGTTTCTC 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATTATTG 
AATGTTCCGT 
CTTTATTACG 
GCTGCTCCAG 
CGTTTTATCA 
ATCGTCCACA 
TTACCAGAAG 
AGTAGAATTA 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 200 

AGGCGATGCT 250 

ATG ATT ATAT 3 00 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTATCAAAGA 450 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATTAC 600 

AAAGATGAAG 65 0 

ATTCTATTTC 7 00 

GT AC TGAAAT 750 

ATCGCTCCTA 800 
828 



60 2 ) INFORMATION FOR SEQ ID NO: 192 

318 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 620 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGTACCAG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 15 0 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 200 

2 0 AAAGC TTT AG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 2 50 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 3 00 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 400 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 450 

25 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 620 



2 ) INFORMATION FOR SEQ ID NO: 193 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TG AC TT ATT A TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 15 0 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 200 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAACCATTCA 2 50 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 350 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATT AG AC T AC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 450 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
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AGAAGGTACT GAAATGGTAA TGCCTGGTGA CAACGTTGAA AT 692 



5 2 ) INFORMATION FOR SEQ ID NO : 194 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 



CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 5-0 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG AC TTATT ATC TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGC GAT GCTCAATACG 2 00 

25 AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 250 

CCAGAACGTG ACTCTGATAA ACCATTCATG ATGCCAGTTG AGGACGTATT 300 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGC CGTGTT GAACGTGGTC 3 50 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 400 

AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 450 

30 TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC ,550 

ACAAAATTCA AAGCAGACGT" ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 65 0 

CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 684 

35 

2 ) INFORMATION FOR SEQ ID NO: 195 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 195 

. ACCAGCATTA GTAGTATTCT TAAACAAAGT TGACATGGTT GACGATGAAG 5 0 

55 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGACGT ACCTGTAATC GCTGGTTCAG CTTTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

CAGTTGATGA TTACATTCCA ACTCCAGAAC GTGACTCTGA TAAACCATTC 2 50 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 3 00 

60 TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 
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TTATTGGTAT CAAAGAAACT TCTAAAACAA 
TTCCGTAAAT TATTAGACTA CGCTGAAGCT 
ATTACGTGGT GTTGCTCGTG AAGATGTACA 
CTCCAGGTTC AATTACACCT CACACAAAAT 
5 TTATCAAAAG ATGAAGGTGG ACGTCATACT 
TCCACAATTC TATTTC CGT A CTACTGACGT 
CAGAAGGTAC TGAAATGGTA ATGCCTGGCG 



CTGTTACTGG TGTAGAAATG 400 

GGTGACAACA TCGGTGCTTT 450 

ACGTGGTCAA GTATTAGCTG 500 

TCAAAGCAGA CGTATACGTT 550 

CCATTCTTCA CTAACTATCG 600 

AACTGGTGTT GTTAAC TT AC 650 

ACAAC 685 



10 



20 



2 ) INFORMATION FOR SEQ ID NO: 196 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 611 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



{ i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominls 

(B) STRAIN: CSG 62 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT G AC G AC GTAC CTGTAATCGC 5 0 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

3 0 G AC TCTGAT A AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC 2 00 

TGGTCGTGGT AC TGTTGC T A CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT C CGT AAATT A TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 400 

35 GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGC AG ACG TATATGTTTT ATCAAAAGAT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AAC TATCGTC CACAATTCTA TTTC CGTACT ACTGACGTAA 55 0 

CTGGTGTTGT TAACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGCGAC 600 

AACGTTGAAA T 611 

40 



2 ) INFORMATION FOR SEQ ID NO: 197 

45 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 
55 (B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
60 GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 
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TTCTTAAACA AAGTTGACAT GGTTG AC GAT GAAGAATTAT TAGAATTAGT 150 

ISS^aa gttcgtgatt tattaactga ATATGACTTC CCAGGTGACG 200 

ATGTGCCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 2 50 

AAATACGAAG CTAAAATCTT AGAATTAATG GATGCAGTTG ATAACTACAT 3 00 

5 ^ScTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

aS^ttctc AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATTATTG GTATCCACGA 450 

T AC TACT AAA AC AAC TGTT A CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGC TG A AGCTGGTGAC AACATCGGTG CGTTATTACG TGGTGTTGCT 5 50 

io ?§?Saaga?g TACAACGTGG ACAAGTATTA GCTGCTCCAG gttcaattac 600 

ACCTCACACT AAATTTAAAG CTGACGTATA TGTTTTATCT AAAGATGAAG 65 0 

GTGG AC GTC A T AC AC C ATTC TTCTCAAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACAG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACAGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGCTATCGA AGACGGAACT CGTTTCTC 828 



15 



20 



2 ) INFORMATION FOR SEQ ID NO: 198 



' i > SEQUENCE CHARACTERISTICS : 
7aT LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Staphylococcus saprophytxcus 

(B) STRAIN: ATCC 35552 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

3 5 AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAACTCGT GAACACATTC 50 

TTTTATCACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT CTTAAACAAA 100 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTG AC TT A TTAAGCGAAT ATGACTTCCC AGGTG AC GAT GTACCTGTAA 2 00 

TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA CTATGAGCAA 2 50 

40 AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC CAACACCAGA 3 00 

ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 350 

TCACTGGTCG TGGTAC TGTT GCTACAGGCC GTGTTGAACG TGGTCAAATC 400 

AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG AATCAAGCAA 450 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

45 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC ACGTGATGAC 550 

GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA CACCACATAC 600 

AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA GGTGGTCGTC 650 

ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 690 

50 

2) INFORMATION FOR SEQ ID NO : 199 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

GCATTAGTTG TATTCTTAAA CAAAGTTGAC ATGGTTGACG ATGAAGAATT 5 0 

ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTTATTAAGC GAATATGACT 100 

10 TCCCAGGTGA CGATGTACCT GTAATCTCTG GTTCTGCATT AAAAGC TTT A 150 

GAAGGCGACG C TG ACT ATG A GCAAAAAATC TT AG AC TT AA TGCAAGCTGT 2 00 

TGATGACTTC ATTCCAACAC CAGAACGTGA TTC TG AC AAA CCATTCATGA 2 50 

TGCCAGTTGA GGACGTATTC TCAATCACTG GTCGTGGTAC TGTTGCTACA 3 00 

GGCCGTGTTG AAC GTGGTC A AATCAAAGTC GGTGAAGAAA TCGAAATCAT 3 50 

15 CGGTATGCAA GAAGAATCAA GCAAAACAAC TGTTAC TGGT G TAG AAATGT 4 00 

TCCGTAAATT ATT AG AC T AC GCTGAAGCTG GTGACAACAT TGGTGCATTA 450 

TTACGTGGTG TTTCACGTGA TGACGTACAA CGTGGTCAAG TTTTAGCTGC 500 

TCCTGGTACT ATTACACCAC ATACAAAATT CAAAGCGGAT GTTT AC GTTT 550 

TATCTAAAGA TGAAGGTGGT CGTCATACAC CATTCTTCAC T AAC T AC C GC 600 

2 0 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

AGAAGGTACT GAAATGGTTA TGCCTGGCGA TAACGTTGAA ATGGATGTTG 700 

AATTAATTTC TCCAATCGCT ATT 72 3 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 2 00 



( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 697 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 00 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT GACATGGTTG 50 

AC G ATGAAG A ATTATTAGAA TTAGTAGAAA TGGAAGTTCG TGACTTATTA 10 0 

AGCGAATATG ACTTCCCAGG TGACGATGTA CCTGTAATCT CTGGTTCTGC 15 0 

4 5 ATT AAAAGC T TTAGAAGGCG ACGCTGACTA TGAGCAAAAA ATCTTAGACT 2 00 

TAATGCAAGC TGTTGATGAC TTCATTCCAA CAC CAGAACG TGATTCTGAC 250 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG 300 

TACTGTTGCT ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTCGGTGAAG 350 

AAATCGAAAT CATCGGTATG CAAGAAGAAT CAAGCAAAAC AAC TGTTACT 400 

.50 GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG CTGGTGACAA 450 

CATTGGTGCA TTATTACGTG GTGTTTCACG TGATGACGTA CAACGTGGTC 500 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CAC ATAC AAA ATTCAAAGCG 550 

GATGTTTACG TTTTATCTAA AGATGAAGGT GGTCGTCATA CACCATTCTT 600 

CACTAACTAC CGCCCACAAT TCTATTTCCG TACTACTGAC GTAACTGGTG 650 

55 TTGTTAACTT ACCAGAAGGT ACTGAAATGG TTATGCCTGG CGATAAC 697 



60 



2 ) INFORMATION FOR SEQ ID NO : 201 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

15 CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTG AC GAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 2 00 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 2 50 

20 GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 3 00 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAAC TGA 450 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

25 TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 600 

CACACCTCAC ACTAAATTCA AAG CTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AAC TTACC AG AAGGT AC TGA 750 

3 0 AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 800 

CAATCGGTAT TGAAGACGGT ATCGTTTCTC AATCA 835 



3 5 2 ) INFORMATION FOR SEQ ID NO: 2 02 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 10 0 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 15 0 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 200 

55 ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

60 CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
5 CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AG ACGGT AC T 



AACATCGGTG CTTTATTACG 
TCAAGTATTA GCTGCTCCTG 
CGGAAGTTTA CGTTTTATCT 
TTCAGTAACT ACCGCCCACA 
CGTTGTTCAA TTACCAGAAG 
TTGAAATGAC AGTAGAATTA 
CGTTTCTCAA C 



TGGTGTTGCT 550 

GTTCAATTAC 600 

AAAGACGAAG 650 

ATTCTATTTC 7 00 

GTACTGAAAT 750 

ATCGCTCCTA 800 
831 



10 

2) INFORMATION FOR SEQ ID NO: 203 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
15 (B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warnerl 

(B) STRAIN: CSG 50 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 15 0 

3 0 AGAAATGGAA GTTCGTGACT TATTATCTGA AT ATG AC TTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA - -250 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 30 0 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 35 0 

ACGTATTCTC AATC AC TGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

35 CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 45 0 

CACTTCTAAA ACAACTGTTA C TGGTGT AG A AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 600 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 65 0 

40 GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 7 00 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 7 50 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 800 

TCGCGATTGA AGACGGTACT CGTTTCTCA 829 



45 



2) INFORMATION FOR SEQ ID NO: 204 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 157 07 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 

TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCCCAGACTC 50 

GCGAGCACGT GCTGCTCGCC CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 100 

5 GCCCTGAACA AGTGCGACAT GGTCGACGAT GAAGAGCTCA TCGAGCTCGT 150 

CGAAGAAGAG GTCCGCGACC TCCTCGACGA GAACGGCTTC GACCGTGACT 2 00 

GC CCGGTC AT CCACACCTCC GCTTACGGTG CTCTGCACGA CGACGCTCCG 2 50 

GACCACGAGA AGTGGGTCCA GTCCGTTAAG GACCTCATGG ACGCTGTCGA 3 00 

CGACTACATC CCGACCCCGG TTCACGACCT GGACAAGCCG TTCCTGATGC 350 

10 CGATCGAGGA CGTCTTCACC ATCTCCGGCC GTGGTACCGT TGTCACCGGT 4 00 

CGTGTCGAGC GTGGCCAGCT GGCCGTCAAC ACCCCGGTCG AGATCGTTGG 450 

TATCCGTCCG ACCCAGCAGA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGACCATGGA CGCCTGCGAG GCTGGCGACA ACACCGGTCT GCTTCTGCGT 55 0 

GGTCTCGGCC GTGACGATGT CGAGCGTGGC CAGGTTGTGG CCAAGCCGGG 600 

15 CTCCGTCACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCATCGAGC TGCCGGAAGG 7 50 

CGTCGAGATG GTTCAGCCGG GCGACCACGC TACCTTCACC GTTGAGCTGA 800 

TTCAGCCCAT CGCTATGGAG GAAGGCCTGA CCTTCGCTG 839 

20 



2 ) INFORMATION FOR SEQ ID NO : 205 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 54 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

( i i ) MOLECULE— TYPE : Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
35 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 05 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

40 GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTC G AC G AC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGC GAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCGACCAG 2 50 

AGCGACATCG GCGTGCCGGC CATCCTGAAG CTGGTCGACG CGCTGGACAG 3 00 

45 CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 3 50 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 400 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

50 ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 75 0 

CGAA 754 

2 ) INFORMATION FOR SEQ ID NO : 2 06 
60 (i) SEQUENCE CHARACTERISTICS:- 
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(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominlmus 
10 (B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 206 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

15 GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTG AC CT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 2 00 

ATCTTCCAGT TGTTCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 2 50 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 3 00 

20 TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 3 50 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATC G AC 4 00 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

2 5 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 7 00 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGC C AG AAGGTACTGA 750 

AATGGTAATG C C TGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

30 CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 835 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 207 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 



50 CTATCCTTGT 
CACATCCTTC 
GAACAAAGTT 
TGGAAATTCG 
CCAGTTATCC 

55 CGAAGACATC 
AACCAGAACG 
TTCTCAATCA 
TACTGTTCGT 
TCCAAAAAGC 

60 GAAGGTCTTG 



AGTTGCTTCA 
TTTCACGTCA 
GACC TTGTTG 
TGACCTTCTT 
AAGGTTCAGC 
ATCATGGAAT 
TG AT AC TG AC 
CTGGACGTGG 
GTCAACGACG 
AGTTGTTACT 
CAGGGGACAA 



ACTGATGGAC 
AGTTGGTGTT 
ATGATGAAGA 
TCAGAATACG 
TCTTAAAGCA 
TGATGAGCAC 
AAACC TTT AC 
TACAGTTGCT 
AAGTTG AAAT 
GGTGTTGAAA 
CGTTGGTGTT 



C AATGC C AC A 
AAACACCTTA 
ATTGCTTGAA 
ACTTCCCAGG 
CTTGAAGGCG 
TGTTGATGAG 
TTCTTCCAGT 
TCAGGACGTA 
CGTTGGTATT 
TGTTCCGTAA 
CTTCTTCGTG 



AAC TCGTG AG 50 

TCGTATTCAT 100 

TTGGTTGAAA 150 

TGATGAC CTT 200 

ACGAAAAATA 250 

TACATTCCAG 300 

TGAAGATGTA 3 50 

TCGACCGTGG 400 

AAAGAAGATA 450 

AC AAC TTGAC 500 

GTGTTCAACG 550 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC 
CACACACTAA ATTTAAAGGT GAAGTTTACA 
GGACGTCATA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTAACAGGTT CAATCGAACT 
TTATGCCTGG TGATAACGTT ACTATCGAAG 
GCCGTAGAAC AAGGTACTA 



TAAACCAGGT TCAATCAACC 600 
TCCTTTCTAA AGAAGAAGGT 65 0 

CGTCCACAAT TCTACTTCCG 7 00 

TCCAGCAGGA ACAGAAATGG 7 50 

TTGAATTGAT TCACCCAATC 800 

819 



10 2 ) INFORMATION FOR SEQ ID NO: 208 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi)QHIGJNAL SOURCE: 

(A) ORGANISM: Streptococcus agslactiae 

(B) STRAIN: ATCC 12973 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 



CTATCCTTGT AGTTGCTTCA AC TGATGG AC C AATGC C AC A AACTCGTGAG 5 0 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

30 C CAGTT ATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 
CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG - 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT CGAAGATGTA 3 50 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

35 TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA AC AAC TTG AC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 65 0 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

40 TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 75 0 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



45 



55 



2) INFORMATION FOR SEQ ID NO: 209 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 822 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactlae 

(B) STRAIN: ATCC 13813 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 

328 



L 



CA 02307010 2000-05-19 



AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 2 00 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA 2 50 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 3 00 

AG AACC AG AA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 3 50 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 4 00 

10 GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 450 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

AC G AAGGTCT TGCAGGGGAC AACGTTGGTG TTCTTCTTCG TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 650 

15 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA ATTCTACTTC 700 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 750 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT AC 822 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 210 



( i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDCss-1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 210 



CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA 100 

40 TTCATGAACA AAGTTGACCT TG TTGATG AT GAAGAATTGC TTGAATTGGT 15 0 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA AT AC G AC TTC CCAGGTGATG 2 00 

ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA AGGCGACGAA 2 50 

AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATG AG TAC AT 3 00 

TCCAGAACCA GAACGTGATA C TG AC AAACC TTTACTTCTT CCAGTTGAAG 350 

45 ATGTATTCTC AATC AC TGG A CGTGGTACAG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTCGTGTCAA CGACGAAGTT GAAATCGTTG GTATTAAAGA 450 

AGATATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 5 00 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTT 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

50 CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT TCTAAAGAAG 65 0 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACAA CTGACGTAAC AGGTTCAATC GAACTTCCAG CAGGAACAGA 75 0 

AATGGTTATG CCTGGTGATA AC GTT ACTAT CGAAGTTGAA TTGATTCACC 800 

CAATCGCCGT AGAACAAGGT AC TAC 825 

55 

2 ) INFORMATION FOR SEQ ID NO : 211 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 
10 (B) STRAIN: ATCC 3 33 97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 50 

15 TGAACACATC CTTCTTTCAC GCCAAGTAGG TGTTAAATAC CTTATCGTCT 100 

TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 150 

GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 2 00 

AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 2 50 

AATATGAAGA CATCATCATG GAATTGATGG AT AC TGTTG A TGAATACATT 3 00 

2 0 CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA 350 

TGTO-T^G-TGA ATTACTSGAC STGSTACTGT TGCTTCAGGA CGTATCGACC 400 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 450 

GAAATCCAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT 500 

GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 550 

2 5 AACGTG AC GA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 600 

CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 650 

AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 7 00 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 800 

30 AATTGCCGTA GAACAAGGAA CTACAT 8 2 6 



2 ) INFORMATION FOR SEQ ID NO: 212 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

50 TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC 50 

GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 2 00 

AAATCCCTGT AATCCAAGGT TCAGCTCTTA AAGCCCTTGA AGGTGACACT 250 

55 CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 3 00 

TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

CGACATCCAA AAAGC TGTTG TTAC TGGTGT TGAAATGTTC CGTAAACAAC 500 

60 TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGTATC 550 

330 



CA 02307010 2000-05-19 



CAACGTGATG AAATCGAACG TGGTCAAGTT 
CCACCCACAC ACTAAATTCA AAGGTGAAGT 
AAGGTGGACG TCACACTCCA TTCTTCAACA 
TTCCGTACAA CTGACGTTAC AGGTTCAATC 
5 AATGGTAATG CCTGGTGATA ACGTT AC TAT 
CAATCGCCGT TGAACAAGGT ACTACAT 



CTTGCTAAAC CAGGTTCAAT 600 
TTACATCCTT ACTAAAGAAG 650 
ACTACCGTCC TCAATTCTAC 700 
GAACTTCCAG CAGGTACTGA 7 50 

CGACGTTGAA TTGATTCACC 800 

827 



10 2 ) INFORMATION FOR SEQ ID NO: 213 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 27823 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 5 0 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TG AC TTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

3 0 ATGGAAATCC . GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT . 2 00 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 25 0 

ATGAAGACAT CATCATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 3 00 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 3 50 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 400 

3 5 GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 450 

ATCCAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 55 0 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAACCAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCCTTACTA AAGAAGAAGG 650 

40 TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 7 50. 

GTAATGC C TG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 800 

TGCCGTAGAA CAAGGAACTA C 821 



45 



2) INFORMATION FOR SEQ ID NO: 214 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 



GCTATCCTTG 
ACACATCTTG 
5 TGAACAAGGT 
ATGGAAATCC 
CCCTGTTGTT 
CCGAAGACAA 
GAACCAAAAC 

10 ATTCTCAATC 
GTACTGTTAA 
ATCCAAAAAG 
TGAAGGTCTT 
GTGATGAAAT 

15 CCACACACTA 
TGGACGTCAC 
GTACAACTGA 
GTTATGCCTG 
CGCTGTTGAA 

20 



TAGTAGCTTC 
CTTTCACGCC 
TGACTTGGTT 
GTGATCTTCT 
CAAGGTTCAG 
GATCATGGAA 
GTGATACTGA 
ACTGGACGTG 
GGTCAATGAC 
CGGTTGTTAC 
GCAGGGGATA 
CGAACGTGGT 
AATTCAAGGG 
ACTCCATTCT 
CGTAACTGGT 
GTGATAACGT 
AAAGGTACTA 



TACAGACGGA 
AAGTTGGTGT 
GACGATGAAG 
TTCAGAATAC 
CTCTTAAAGC 
TTGATGGACA 
TAAGCCATTG 
GTAC TGTTGC 
GAAGTTGAAA 
CGGAGTTGAA 
ACGTTGGTGT 
CAAGTATTGG 
TGAAGTTTAC 
TCAACAACTA 
TCAATCGAAT 
TACTATCGAC 
C 



CCAATGCCAC 
TAAGAGCCTT 
AATTGCTTGA 
GATTTCCCAG 
CCTTGAAGGT 
TCGTTGATGA 
CTTCTTCCAG 
TTCAGG AC GT 
TCGTTGGTAT 
ATGTTCCGTA 
GCTTCTTCGT 
CTGCACCTGG 
ATCCTTTCTA 
CCGTCCACAG 
TGCCAGCAGG 
GTTGAATTGA 



AAACTCGTGA 5 0 

ATCGTCTTCA 100 

ATTGGTTGAA 15 0 

GTGATGATAT 2 00 

GATACAGCTG 25 0 

CTACATTCCA 300 

TCGAAGACGT 350 

ATCGACCGTG 400 

CAAGGACGAA 450 

AACAATTGGA 500 

GGTATCCAAC 550 

TTCAATCCAT 600 

AAGATGAAGG 650 

TTCTACTTCC 7 00 

TACTGAAATG 750 

TCCACCCAAT 800 
821 



2 ) INFORMATION FOR SEQ ID NO : 215 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



~"(~i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cristatus 
35 (B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

TATCCTTGTA GTAGCTTCAA C TG AC GG AC C AATGCCACAA ACTCGTGAGC 50 

40 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 200 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 25 0 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 3 00 

45 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 350 

TCTCAATCAC TGGTCGTGGT AC AGTTGC TT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 450 

C C AAAAAGC A GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT C CTTAC T AAA GAAGAAGGTG 65 0 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 7 00 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 800 

55 CCGTTGAACA AGGTACTCCT T 82 1 



2 ) INFORMATION FOR SEQ ID NO: 216 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus downel 

(B) STRAIN: ATCC 33748 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

15 AGTAGCTTCT ACTGATGGAC CAATGCCACA AACTCGTGAA CACATCTTGC 50 

TTTCACGTCA GGTTGGTGTT AAGAACCTTA TCGTC TTC AT GAACAAGGTT 100 

GACTTGGTTG ACGATGAAGA ATTGCTTGAA TTGGTTGAAA TGGAAATCCG 15 0 

TGACCTGCTT TCAGAATACG ATTTCCCAGG TGATGATATC CCTGTTGTTC 2 00 

AAGGTTCAGC TCTTAAGGCT CTTGAAGGTG ATACAGCTGC CGAAGACAAG 250 

2 0 ATCATGGAAT TGATGGACAT CGTTGATGAC TACATTCCAG AACCAAAACG 3 00 

TGATACTGAT AAGCCTTTGC TTCTTCCAGT CGAAGATGTA TTCTCAATCA 350 

CTGGACGTGG TACTGTAGCT TCAGGACGTA TCGACCGTGG TACTGTTAAG 400 

GTCAACGACG AAGTTGAAAT CGTTGGTATC AAGG AC G AAA TCCAAAAAGC 450 

AGTTGTT AC C GGAGTTGAAA TGTTCCGTAA ACAATTGGAC GAAGGTCTTG 5 00 

25 CAGGGGATAA CGTTGGTGTG CTTCTTCGTG GTATCCAACG TGATGAAATC 550 

GAACGTGGTC AAGTGTTGGC TGCGCCTGGT TCGATTC AC C CACACACTAA 600 

GTTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT GGACGTCATA 65 0 

CTCCATTCTT TAACAACTAC CGTCCACAGT TCTACTTCCG TACAACTGAC 7 00 

GTAACTGGTT CAATCGAATT GCCAGCGGGT ACTGAAATGG TTATGCCTGG 7 50 

3 0 TGATAACGTT AC TATCG ACG TTGAATTGAT CCACCCAATT GC 792 



35 



2 ) INFORMATION FOR SEQ ID NO: 217 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 4307 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG AC C AATGCC A CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 100 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 150 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 2 00 

C C AAGGTTC A GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 250 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 3 00 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 400 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 450 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 5 00 

60 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 5 50 
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TCGAACGTGG TCAAGTTATT GCTAAACCAG 
AAATTCAAAG GTGAAGTATA TATCCTTTCT 
CACTCCATTC TTCAACAACT ATCGTCCACA 
ACGTAACAGG TTCAATCGAA CTTCCAGCTG 
5 GGTGATAACG TGACAATCAA CGTTGAGTTG 



GTTCAATCAA CCCACACACT 600 

AAAGACGAAG GTGGACGTCA 650 

ATTCTACTTC CGTACAACTG 7 00 

GTACAGAAAT GGTTATGCCT 7 50 

ATCCACCCAA TCGCC 7 95 



2 ) INFORMATION FOR SEQ ID NO : 218 



10 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus egui subsp. egui 

(B) STRAIN : ATCC 952 6 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 

25 CGGAGCTATC CTTGTAGTTG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTTG GTGTTAAGCA CCTTATCGTG 100 

TTCATGAACA AGGTTGACCT TGTTGACGAT GAAGAATTGC TTGAGCTTGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 200 

ACCTTCCAGT TATCCAAGGT TCAGCGCTTA AGGCTCTTGA AGGCGACAGC 250 

30 AAATACGAAG ATATCATCAT GGAATTGATG GATACTGTTG ATTCATACAT 3 00 

- • - TCGAGAACC A GAACGTGACA C AG AC AAGC C ATTGCTTCTT CCAGTCGAGG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 400 

CGCGGTACTG TTCGTGTTAA CGACGAAATC G AAATC GTTG GTATCAGAGA 45 0 

CGAGATCAAA AAAGCAGTTG TTACTGGTGT CGAAATGTTC CGTAAACAGC 500 

35 TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT ATTGCTAAGC CAGGTTCTAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT AT AT ATC CTT AC TAAAG AAG 65 0 

AAGGTGGACG TCACACACCA TTCTTCAACA ACTATCGTCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC AGGTTCAATC GAGCTTCCAG CAGGTACAGA 75 0 

40 AATGGTTATG CCTGGTGATA ACGTG AC TAT TG AC GTTG AG TTGATCCACC 800 

CAATCGCCGT AGAACAAGGT ACTACATT 828 



45 2 ) INFORMATION FOR SEQ ID NO: 219 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 



60 



( xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 219 
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CGGTGCAATC CTTGTAGTAG CTTCTACAGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAGGTAG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACTT GGTTG AC GAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TGCTTTCAGA ATATGATTTC CCAGGTGATG 200 

5 ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATACT 2 50 

GCTCAAGAAG ATGTTATCAT GGAATTGATG AAAACCGTTG ATGAGTACAT 3 00 

CCCAGAACCA GAACGTGATA C TG AC AAACC ATTGCTTCTT CCAGTCGAAG 3 50 

ATGTATTCTC AATCACAGGT CGTGGTACTG TAGCTTCAGG ACGTATC GAT 400 

CGTGGTACTG TAAGAGTCAA CGATGAAGTT GAAATCGTTG GTATCAAAGA 450 

10 CGAAATCACT AAAGCAGTTG TTACCGGTGT TGAAATGTTC CGTAAACAAT 5 00 

TGGACGAAGG TCTTGCTGGT GATAACGTTG GTGTGCTTCT CCGTGGTGTG 550 

CAACGTGATG AAATCGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGTGGACG TCATACACCA TTCTTCAACA ACTACCGTCC AC AGTTC TAC 700 

15 TTCCGTACAA CTGACGTAAC TGGTTCAATC G AATTGC C AG CAGGTACTGA 7 50 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTAC 825 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID NO: 220 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10558 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 22 0 

CGGAGCTATC CTTGTAGTAG CTTCAACTGA TGGTCCTATG CCACAAACTC 50 

GTGAGCACAT CCTTCTCTCA CGCCAAGTTG GTGTTAAACA CTTGATCGTG 100 

TTCATGAACA AAGTTGACTT GGTTGACGAT GAAGAATTGC TTGAGTTGGT 150 

40 TGAAATGGAA ATCCGTGACC TCTTGTCAGA ATACGACTTC CCAGGTGACG 200 

ATCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCTCTTGA AGGTGACTCT 250 

AAATATGAAG ATATCATCAT GGAATTGATG AAC AC TGTTG ATGAGTACAT 3 00 

CCCAGAACCA GAACGCGACA CTGAC AAACC ATTGCTTCTT CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 400 

45 CGTGGTATCG TTAAAGTCAA TGACGAAATC GAAATCGTTG GTATCAAAGA 450 

AGAAATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTGCTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG TGGACAAGTT ATTGCTAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTATATCCTT ACTAAAGAAG 650 

50 AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC AC AGTTC TAC 700 

TTCCGTACAA CTGACGTTAC AGGTTCAATC GAACTTCCAG CAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGACGTTGAG TTGATCCACC 800 

CAATCGCCGT TGAACAAGGT ACTACT 82 6 



2) INFORMATION FOR SEQ " ID NO: 221 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 799 bases 
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10 



30 



45 



(B) TYPE: Nucleic acid 
{ C ) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



TGTAGTAGCT TCAACTGACG GACCAATGCC TCAAACTCGT GAACATATCC 50 

TTCTTTCACG TCAAGTAGGT GTTAAATACC TTATTGTCTT CATGAACAAA 100 

15 GTTGACTTGG TTGACGATGA AG AATTGC TT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT ATTCCAGTAA 200 

TCCAAGGTTC AGC AC TTAAA GCTCTTGAAG GTGATGAAAA ATATGAAGAC 2 50 

ATCATCATGG AATTGATGAA TACTGTTGAT GAATATATTC CAGAACCAGA 3 00 

ACGTGATACT GAC AAACC AT TGCTTCTTCC AGTCGAAGAT GTATTCTCAA 3 50 

2 0 TCACTGGACG TGGTACTGTT GCTTCAGGAC GTATCGACCG TGGTACTGTT 400 

AAAJGTCAACG ATGAAGTTGA AATGGTTSGT ATGCSCSAGG AAATCGAAAA 45 G 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAATTG GACGAAGGTC 500 

TTGCTGGAGA TAACGTAGGG GTTCTTCTTC GTGGTATCCA ACGTGACGAA 550 

ATTGAACGTG GACAAGTTCT TGCTAAACCA GGTTCAATTC ATCCACACAC 600 

2 5 TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 650 

ATACTCCATT CTTCAACAAC TACCGTCCTC AATTCTACTT CCGTACTACA 700 

GACGTTACAG GTTCAATCGA ACTTCCTGCA GGT AC TGAAA TGGTAATGCC 750 

TGGTGATAAC GTAACAATTG ATGTTGAGTT GATCCACCCA ATTGCCGTA 799 



2 ) INFORMATION FOR SEQ ID NO: 222 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 3 5911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 



TGGTGCTATT CTTGTAGTAG CTTCAACTGA CGGTCCAATG CCTCAAACGC 50 

GTGAACATAT CCTTCTTTCA CGCCAAGTAG GTGTTAAAAA CCTTATTGTT 100 

50 TTCATGAATA AAGTTGAC TT AGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTACAGA ATATGATTTC CCAGGCGATG 200 

AACTTCCAGT TATCCAAGGT TCAGCACTTA AAGCTCTTGA AGGTGATACT 250 

AAGTACGAAG ATATTATCAT GGAATTGTTG GATACTGTAG ATGATTACAT 3 00 

CCCAGAACCA CAACGTGATA CTGACAAGCC ATTGCTTCTT CCAGTCGAAG 350 

55 ATGTTTTCTC TATTACTGGA CGTGGTACTG TTGCTTCAGG ACGTATTGAC 400 

CGTGGTACTG TTAAGGTTAA TGATGAAGTT GAAATCGTTG GTATTCGTGA 450 

CGATATTCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTCG GTGTCCTTCT TCGTGGTATC 550 

CAACGTGATG AAATTGAACG CGGTCAAGTT CTTGCTAAAC CAGGATCAAT 600 

60 TCATCCACAT ACTAAATTCA AAGGTGAAGT TTATATTCTT AC TAAAG AAG 65 0 
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AAGGTGGACG TCATACTCCA 
TTCCGTACAA CTGATGTAAC 
AATGGTTATG CCTGGTGATA 
CAATCGCTGT TGAACAAGGT 



TTC TTTAAC A ACTACCGTCC 
TGGTTCAATT GATTTGCCAG 
ATGTTACGAT TGATGTTGAA 
ACAAC 



ACAGTTCTAC 7 00 

CAGGTACTGA 7 50 

CTGATCCACC 800 
825 



2 ) INFORMATION FOR SEQ ID NO : 223 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

15 

( i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
20 Streptococcus mitis) 

(B) STRAIN: ATCC 33399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 223 

25 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GAC TTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 25 0 

30 CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATC CC AG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

3 5 GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 7 00 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

40 TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC AT 822 



45 2 ) INFORMATION FOR SEQ ID NO: 224 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

5 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

{ ii ) MOLECULE TYPE: Genomic DNA 



55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: ATCC 25175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 

60 
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CCCTGGTGCT ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGCCACAAA 5 0 

CTCGTGAACA CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 100 

GTCTTCATGA ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 15 0 

GGTTGAAATG GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 2 00 

5 ATGATATTCC AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 2 50 

ACTGCTCAAG AAGATATCAT CATGGAATTA ATGCATACTG TTGATGACTA 3 00 

CATTCCAGAT CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 350 

AAGATGTTTT CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 400 

GATCGTGGTA CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 45 0 

10 TGATGACATT CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 500 

AATTGGATGA AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 550 

ATCCAACGTG ATGAAATCGA ACGTGGTCAA GTTCTTGCTA AACCAGGTTC 600 

AATTCACCCA CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 650 

AGGAAGGTGG ACGTCATACA CCATTCTTCA ATAACTATCG TCCACAATTC 700 

15 TACTTCCGTA CAACTGACGT AAC TGGTTC A ATTGAGTTGC CAGCAGGTAC 750 

TGAAATGGTT ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 800 

ATCCAATCGC TGTTGAACAA GGTACTA 827 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID NO : 225 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE: Genomic DNA 
- (-yi-)-ORIGINALlSOURCE : 



(A) ORGANISM: Streptococcus parasangulnis 

(B) STRAIN: ATCC 15912 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

40 AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTC CCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TG AC TCT AAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 300 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

45 GGTGTTGTTC GTGTC AATG A TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAAC CAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

50 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2 ) INFORMATION FOR SEQ ID NO : 22 6 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 824 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 19645 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 5 0 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTG AC TT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 2 00 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTATCAT GG AATTGATG AAAACAGTTG ATGAGTACAT 3 00 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 3 50 

2 0 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 400 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 45 0 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 500 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 550 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT AC TAAAG AAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 7 00 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGC C AG CAGGTACTGA 7 50 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 800 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2 ) INFORMATION FOR SEQ ID NO : 227 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

{ C ) STRANDEDNESS : Double 
{ D ) TOPOLOGY : Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
45 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 250 

ATCATCATGG AATTGATGGA C AC TGTTG AT GAGTACATCC CAGAACCAGA 3 00 

55 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 400 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

60 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 
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TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 650 

AGTTCTACTT CCGTACAACT 700 

GGT AC TG AAA TGGTAATGCC 750 

GATCCACCCA ATCGC 79 5 



2) INFORMATION FOR SEQ ID NO: 228 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sobrinus 
2 0 (B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 8 

TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 50 

25 TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTG AC G AC ATTCCTGTTG 2 00 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 2 50 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAACCAAA 3 00 

30 ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 400 

AAGGTTAACG AC G AAGTTGA AATCGTTGGT ATCCGTGACG ATATCCAAAA 450 

AGCAGTTGTT AC TGG AGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 5 00 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 550 

35 ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 600 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 650 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 7 00 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 750 

TGGTGATAAC GTTAC TATCG ACGTTGAATT GATCCACCCA ATCGC 7 95 

40 



2) INFORMATION FOR SEQ ID NO: 229 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 
55 (B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
60 TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG 
CCGTGACCTT 
TCCAAGGTTC 
ATCGTTATGG 
5 ACGCGACACT 
TCACTGGTCG 
CGTGTCAACG 
AGCAGTTGTT 
TTGCCGGCGA 
10 ATCGAACGTG 
TAAATTCAAA 
ACACTCCATT 
GACGTAACTG 
TGGTGATAAC 

15 



TTGACGATGA 
CTTTCAGAAT 
AGC TCTT AAA 
AATTGATGAA 
GAC AAAC CAT 
TGGTACTGTA 
ACGAAATCGA 
ACTGGTGTTG 
TAACGTTGGT 
GTCAAGTTAT 
GGTGAAGTTT 
CTTCGACAAC 
GTTCAATCAA 
GTTACTATCG 



AGAATTGCTT 
ACGATTTCCC 
GCTCTTGAAG 
CACTGTTGAT 
TGTTGCTTCC 
GCTTCAGGAC 
AATCGTTGGT 
AAATGTTCCG 
GTGC TTCTTC 
CTCTAAACCA 
ACATCCTTAC 
TACCGTCCAC 
ATTGCCAGAA 
ACGTTGAATT 



GAGTTGGTTG 
AGGTGATGAT 
GTGACTCTAA 
GAGTACATTC 
AGTCGAGGAC 
GTATCGACCG 
CTTCAAGAAG 
T AAAC AAC TT 
GTGGTGTACA 
GGTTCTATCA 
TAAAGAAGAA 
AGTTCTACTT 
GGTAC TG AAA 
GATCCACCCA 



AAATGG AAAT 15 0 

CTTCCAGTTA 2 00 

GTACGAAGAC 2 50 

CAGAACCAGA 3 00 

GTATTCTCAA 3 50 

TGGTACTGTT 400 

AAAAATCTAA 45 0 

GAC GAAGGTC 500 

ACGTGATGAA 55 0 

ACCCACACAC 600 

GGTGGACGTC 650 

CCGTACAACT 7 00 

TGGTAATGCC 75 0 

ATCGCCG 797 



2 ) INFORMATION FOR SEQ ID NO: 23 0 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE SS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uberls 
3 0 (B) STRAIN: ATCC 1943 6 

(xi-)-SEQUENGE~ DESCRIPTION : • SEQ ID -NO :- 2 3 0 - - 



TTGTTGTTGC ATCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC 50 

3 5 CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 200 

ATCCAAGGTT C AGC TCTT AA AGC TC T TG AA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CC AG AAC C AG 300 

40 AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGT AC TGT 400 

TCGTGTCAAC GAC G AAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 45 0 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

45 AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

C TAAATTC AA AGGTGAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGT AC TGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 793 

50 



2 ) INFORMATION FOR SEQ ID NO : 2 31 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 
5 (B) STRAIN: ATCC 49124 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 231 

TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

10 CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 15 0 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA TATTCCAGTT 2 00 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 2 50 

CATCATCATG GACTTGATGA AC AC TGTTG A CGAATACATT CCAGAACCAG 3 00 

15 AACGTGACAC TGACAAACCA TTGTTGCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 400 

TCGTGTTAAT G AC GAAGTTG AAATCGTTGG TCTTAAAGAA G AAATC C AAA 450 

AAGCAGTTGT T AC TGGTGT A GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 55 0 

20 AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 600 

CTAAATTCAA AG'GTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 65 0 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 7 00 

TGAC GTAAC A GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 

25 

2 ) INFORMATION FOR SEQ ID NO: 232 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) * LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
40 (B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

45 TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 200 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 250 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 300 

50 CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 400 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 45 0 

■ ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 

CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 65 0 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 75 0 

GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

60 CGCGATGGAC GATGGTCTGC GTTTCGCAA 82 9 

342 
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2 ) INFORMATION FOR SEQ ID NO: 23 3 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella guamensis 

(B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

20 GGCGCAATCC TGGTAGTAGC AGCGAC TG AC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 2 00 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 2 50 

2 5 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCTGGTC GTGGTACCGT TGTTAC CGGT CGTGTAGAGC 400 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 5 00 

3 0 CGAAGGCCGT GC TGGTG AG A ACGTAGGTGT TCTGCTGCGT GGTATC AAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

3 5 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 82 9 



40 2) INFORMATION FOR SEQ ID NO: 234 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Velllonella parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 2 00 

60 AAGTAC CT AT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 2 50 

343 
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CAATATGTAG CTAAAATTGA CGAATTGATG GACGCTGTAG AC TC C T AC AT 3 00 

CCCAACACCA GTTCGTGACA CTGATAAACC ATTCTTGATG CCTGTGGAAG 350 

ATGTTTTCAC AATCACTGGT CGTGGTACAG TAGCAACTGG CCGTGTTGAA 400 

CGTGGTCAAG TAAACGTTGG TGATACTGTT GAAGTAGTAG GCTTGAAAGA 450 

5 AAAAGC TGAA CAATACGTAG TAACAGGTCT TGAAATGTTC CGTAAAGTGT 500 

TGGATTCTGC AGTAGCAGGT GACAACGTAG GTGCATTGCT TCGTGGTGTT 55 0 

GATCGTAAAG ACATYGAACG TGGTCAAGTA TTGGCTAAAC CAGGTTCCAT 600 

CAACCCACAY ACAAAATTCA AAGCAGAAGT ATACGTATTG ACTAAAGAAG 65 0 

AAGGTGGTCG TCATACTCCA TTCTTCTCCA ACTACCGTCC AC AATTC TAC 700 

10 TTCCGTACAA CAGACGTAAC AGGTGTTGTA AACCTTCCTG AAGGTGTAGA 750 

AATGTGTATG CCTGGCGATA ACGTAACAAT GGAAATCGAA TTGATTACTC 800 

CAATCGCTAT CGAAGAAGGT CTTCG 825 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO: 23 5 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 5 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY*: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitlca 

(B) STRAIN: ATCC 9610 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG - "GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA G AGC TGC TAG AACTGGTAGA 150 

35 AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 2 00 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TAC CTGGATT CTTACATCCC 300 

AGAACCAGAG CGTGCTATCG ATAAGC CGTT CCTGCTGCCA ATCGAAGACG 350 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 400 

40 GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 450 

CGTT AAATC T ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 500 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 600 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 6 50 

45 GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 7 00 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2 ) INFORMATION FOR SEQ ID NO : 236 



( x ) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid' 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DN A 

344 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 236 

GGCGCGATCC TGGTTGTTGC TGCCACTGAT GGCCCGATGC CACAGACTCG 50 

CGAGCACATT CTGTTAGGGC GTCAGGTGGG TGTTCCTTAC ATCCTGGTCT 100 

10 TCCTGAACAA ATGTGACATG GTTGACGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CTGGCGACGA 2 00 

CACTCCAGTT ATCCGTGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCTG 2 50 

AGTGGGAAGC AAAAATCATC GAATTGGCTG AGGCGCTGGA TAGCTATATT 3 00 

CCACAGCCAG AGCGTGCGAT TGATAAACCA TTCCTGCTGC CAATCGAAGA 3 50 

15 CGTATTC TC A ATCTCTGGCC GTGGTACTGT TGTC AC CGGT CGTGTAGAGC 400 

GCGGTATCGT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 450 

ACCATCAAGA CTACCTGTAC TGGTGTTGAA ATGTTCCGCA AATTGCTGGA 5 00 

CGAAGGCCGT GCGGGTGAGA ACGTTGGTGT TCTGCTACGT GGTACTAAAC 550 

GTGATGACGT AC AAC GTGGT CAGGTATTGG CAAAACCAGG TTCTATCAAG 600 

2 0 CCACACACCA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGCCAT ACTCCGTTCT TCAAAGGTTA TCGTCCTCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 7 50 

GTGATGCCAG GTG AT AAC AT TCAAATGATT GTTAACCTGA TTGCTCCTAT 800 

CGCAATGGAT GACGGTCTGC GCTTTGCG 828 

25 



2 ) INFORMATION FOR SEQ ID NO : 2 37 

3 0 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
40 (B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

45 TCCTGCTAGG TCGTCAGGTG GGTGTTC CTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG 2 00 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 2 50 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 300 

5 0 AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 3 50 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 400 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 450 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG G AC GAAGGC C 5 00 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550 

55 GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGC C AC AT AC 6 00 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 7 00 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 7 50 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

60 ATGATGGTCT GCG 813 

345 
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2 ) INFORMATION FOR SEQ ID NO : 23 8 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

2 0 GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC CGCAGACTCG 50 

TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCC CAT AC ATTATTGTCT 100 

TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT AGAGTTGGTT 150 

GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC CAGGCGACGA 2 00 

CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA GGTGACGCTG 2 50 

2 5 AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA TAGCTATATT 3 00 

CCGCAACCAG AACGCGCTAT TGATAGACCA TTCCTATTGC CAATTGAAGA 3 50 

CGTATTCTCT ATTTCTGGTC GTGGTAC TGT AGTTACTGGT CGTGTAGAAC 400 

GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG TATTATCGAT 450 

ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA AGCTGCTGGA 500 

3 0 TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG TTCTATCAAG 600 

CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 65 0 

CGGCCGTCAT ACACCGTTCT TCAAGGGCTA CCGTCCTCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG CGTTGAAATG 750 

35 GTCATGCCTG GTGACAACGT AAACATGGTT GTTAACCTAA TTGCTCCTAT 800 

CGCAATGGAT GATGGTCTGC GCTTCGCAA 829 



40 2 ) INFORMATION FOR SEQ ID NO: 239 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

( D ) TO POLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 



TGGAGCGATC TTGGTTGTTG CTGCAACCGA TGGCCCTATG CCGCAGACTC 50 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTCCCATA CATTATTGTC 100 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 150 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 2 00 

60 ACACTCCAGT CATCCGTGGT TCAGCGTTGA AAGCCCTGGA AGGTGACGCT 250 

346 
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GAGTGGGAAG CTAAAATTAT 
TCCGCAACCA GAACGCGCTA 
ACGTATTCTC TATTTCTGGT 
CGCGGTATTG TTAAGGTCGG 
5 TAC G ATT AAA ACAACTTGTA 
ATGAAGGCCG TGCTGGTGAA 
CGTGACGATG TTCAGCGTGG 
GCCACACACG AAGTTTGAGT 
GCGGCCGTCA TACACCGTTC 
10 CGTACAACTG ACGTGACCGG 
GGTCATGCCT GGTGACAACG 
TCGCAATGGA TGATGGT 



CGAGTTGGCA GAAGCTCTGG 
TTGATAGACC ATTCCTATTG 
CGTGGTACTG TAGTTACTGG 
CGAAGAAGTT G AAATC GTTG 
CTGGCGTTGA AATGTTCCGC 
AATGTTGGTG TTCTGCTGCG 
TCAAGTACTG GCGAAACCAG 
CAGAAGTTTA TATTCTGAGC 
TTCAAGGGCT ACCGTCCTCA 
TACCATTGAG CTGCCAGAAG 
TAAACATGGT TGTTAACCTA 



ATAGCTATAT 3 00 

CCAATTGAAG 3 5 0 

TCGTGTAGAA 400 

GTATTATCGA 4 50 

AAGCTGCTGG 500 

TGGTACTAAG 550 

GTTCTATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 7 00 

GCGTTGAAAT 750 

ATTGCTCCTA 800 
817 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO: 240 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 829 bases 
20 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 

'TCGCGCGATC CTGGTTGTTG CTC^AACTGA "TGGCCtAATG CCACAGACTC 50 

GCGAGCACAT CCTGTTGGGT CGTCAAGTGG GTGTTCCTTA CATCTTAGTC 100 

TTCCTGAACA AGTGTGACAT GGTTGACGAC GAAGAGTTGC TGGAACTGGT 150 

3 5 TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCTGGCGATG 200 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAGGCC 250 

GAGTGGGAAG CCAAAATTAT TGAACTTGCT GAAGCACTGG ATAGCTACAT 3 00 

TCCACAGCCA GAGCGCGCGA TTGATAAACC ATTCTTGCTG CCAATCGAAG 3 50 

ACGTATTCTC TATCTCAGGC CGTGGAACAG TTGTTACCGG GCGTGTTGAG 400 

40 CGCGGTATCG TCAGAGTGGG CGAAGAAGTT GAAATCGTGG GTATCATCGA 450 

CACCATTAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTTGGTG TTCTGCTGCG CGGTACTAAA 550 

CGCGATGACG TGCAACGTGG TCAAGTGTTG GCTAAACCAG GTTCTATTAA 600 

GCCGCATACC AAATTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

45 GTGGTCGTCA TACTCCGTTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC ' 7 00 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGACG GTGTTGAGAT 750 

GGTGATGCCA GGTGATAACA TTCAAATGAT TGTTAACCTG ATTGCGCCTA 800 

TTGCAATGGA TGACGGTCTA CGATTTGCA 829 



2 ) INFORMATION FOR SEQ ID NO : 241 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

347 



CA 02307010 2000-05-19 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 5 0 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GG TTGATG AC GAAGAGCTGC TGGAACTGGT 15 0 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 20 0 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 250 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 350 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 400 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT G AAATC GTTG GTATCAAAGA 450 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

C GTG AAG AAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

2 0 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 8 00 

TCGC 804 

25 



2 ) INFORMATION FOR SEQ ID NO: 242 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE :~ Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp . 
4 0 denitrificans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

45 TCAGTTCCCC CGCGATCACA TG CC C AAG AT CTACGAAGCG CTTACTCTGG 5 0 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GG AAGTGCAG 100 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 150 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 2 00 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 3 00 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 3 50 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 4 00 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 450 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 500 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 55 0 

GAAGAGTCGA AC GTTC TGG A CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

C C GAG AAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 7 00 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

60 TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 800 

348 
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AGCTGCAAGA GCGCATCACC TCGACCAAGA CCGGCTCGAT CACCTCGAT 84 9 



5 2 ) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 87 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 43 



20 



TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCATCGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT T AC TGTAAC T 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 15 0 

TATCATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 2 00 

2 5 CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAAC TGGT ATTAAAGTCA TC G AC TT AC T 3 00 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 3 50 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 4 00 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 450 

3 0 TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTG AC TAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 600 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACACTTGC AGGTACTGAA G T ATC AG CAT TGTTAGGTCG TATGCCATCT 7 00 

3 5 GCGGTAGGTT ACCAACCTAC AC TTGC AG AA GAAATGGGTG TTCTTCAAGA 7 50 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 7 87 



40 2 ) INFORMATION FOR SEQ ID NO: 244 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

50 { vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetojbacter Iwoffil 
<B) STRAIN: CDCF 3697 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 



55 



GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGC AC TG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAACTAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 2 00 

60 TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 2 50 

349 
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TGCCGATTCA CCGTCAAGCA CCTTCTTATG CTGAACAAGC AGCTTCTACT 3 00 

GACCTTTTAG AAACTGGTAT TAAAGTCATC GACTTACTTT GCCCGTTCGC 350 

TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG TGCCGGTGTT GGTAAAACTG 400 

TAAACATGAT GGAGTTGATC AACAACATCG CTAAAGCGCA CTCAGGTTTA 450 

5 TCTGTATTCG CTGGTGTTGG TGAGCGTACT CGTGAAGGTA ATGAC TTCT A 5 00 

TCACGAGATG AAAGACTCAA ACGTTCTAGA CAAAGTAGCA ATGGTCTACG 550 

GTCAGATGAA CGAGCCACCG GGTAACCGTT TACGCGTAGC GTTGACTGGT 600 

TTGACCATGG CTGAGTACTT CCGTGACGAG AAAGACGAAA ACGGCAAAGG 650 

CCGTGACGTA CTATTGTTCG TAGATAACAT CTAC CGTTAT ACACTAGCAG 700 

10 GTACTGAAGT ATCAGCACTT CTAGGTCGTA TGCCGTCTGC AGTAGGTTAC 750 

CAACCGACAC TTGCAGAAGA GATGGGTGTT CTTCAAGAAC GTATTACATC 800 

GACTAAGTCT GGTTC G ATT A CGTCA 82 5 



15 



25 



50 



2 ) INFORMATION FOR SEQ ID NO : 245 



{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 837 bases 
20 (B) TYPE: Nucleic acid 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

CAATGAAGTT C C AG AAATT A ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTC GTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 15 0 

35 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATAC TTCTGT CAAACGTGAT CCAATTCATA G AGAAGC AC C 3 00 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 35 0 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

40 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 45 0 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AAC GTACGC G TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 55 0 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

45 GTGATGTACA AGGACAAGAT GTTTTAC TAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GT ATGC CATC 75 0 

AGC CGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AAC GT ATT AC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 246 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

350 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 

TTCCCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 50 

CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTTGAA GT AC AG C AAC 100 

10 AATTGGGTGA CGGTGTAGTT CGCACCATCG CCATGGGTTC CAGCGACGGC 150 

CTGCGCCGCG GTATGGAAGT GGCCGGTTCG GGCGCTCCCA TCTCCGTTCC 2 00 

CGTGGGTGTC GGCACCCTGG GTCGCATTAT GGACGTTCTG GGTCGCCCTA 2 50 

TTGACGAAGT CGGTCCTATT CAGTCCGACG AGCGTCGCGC CATTCACCAG 3 00 

CCTGCGCCTA CTTTCGACGA ACTGTCGCCT TCCGTAGAGC TGCTGGAAAC 350 

15 C GGT ATT AAA GTGATTGACC TGGTTTGCCC GTTCGCCAAG GGTGGTAAGG 400 

TTGGTCTGTT CGGTGGTGCC GGTGTGGGCA AGACCGTGAA CATGCTGGAG 450 

CTGATCAACA ACATCGCCAA GGCACACAGC GGTCTGTCCG TGTTTGCCGG 5 00 

TGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGGCCG 550 

ATGCTGGCGT TATCC AGATG GACAACCTGA GCGAGTCCAA AGTGGCCATG 600 

2 0 GTGTTCGGTC AGATG AACGA ACCTCCAGGC AACCGTCTGC GTGTGGCACT 650 

GTCCGGCCTG ACCATGGCCG AGAAGTTCCG TGACGAAGGC CGTGACATCC 7 00 

TGTTCTTTGT GGACAACATC T AC CGCTAC A CGCTGGCCGG TACAGAAGTG 750 

TCCGCTCTGC TGGGTCGTAT GCCTTCCGCA GTGGGTTACC AGCCTACGCT 800 

GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG 850 

25 G 851 



30 



2) INFORMATION FOR SEQ ID NO: 247 



( i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Bacillus anthracls 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

45 GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 2 00 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 2 50 

50 AC TTAG ATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 3 00 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAG TAG AAA TTCTTGAAAC 3 50 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAAC AC GGT GGTATCTCTG TATTCGCTGG 5 00 

55 TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATC GATAAT A 7 00 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

60 ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 

351 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 846 



5 2 ) INFORMATION FOR SEQ ID NO: 248 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 



20 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 



CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG A&AACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 15 0 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 2 00 

25 AC GTGTATTC AAC GT ATT AG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

CTGCGGATGT AC AC C G TG AT CCAATTCACC GTCAAGCACC TGCATTCGAA 3 00 

GAATTATCTA CTAAAGTAGA AATTCTTGAA AC TGGTATTA AAGTAGTAGA 3 50 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 45 0 

3 0 CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

_TGAGGGTAAT._GACTTATACC ACG AAATG AG CGATTCTGGC GTAATCAAGA 5 50 

AAACTGCGAT GGTATTCGGA C AAATG AACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AAC AATGGC T GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATC TTC CGT TTCACGCAAG 700 

35 CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GT ATGC CATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AG CGT ATT AC 800 

ATCTACAAAT 810 



2 ) INFORMATION FOR SEQ ID NO: 249 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 944 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 . 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

60 CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 200 
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TGCCGACCGG CGACCAAGTC AAAGGACGTT TAATGAATGT CACCGGCGAC 2 50 

CCTATCGATG GCATGGCCCA GCTTACTAAA GACGGGGCTC TTCCCATCCA 3 00 

TCGTGAGCCT CCTAAATTCG AGGATCTGAC AACGACCCAA GAGGTCTTGT 350 

ACACGGGTAT TAAAGTAATC GATTTATTGG AACCTTACGC CAAAGGAGGC 4 00 

5 AAGATCGGAC TTTTCGGAGG AGC CGGAGTC GGCAAAACGG TATTGATCAT 450 

GGAATTGATC AACAACATCG CAAAGAAAAA CAACGGATTC TCCGTCTTCG 500 

CCGGTGTGGG TGAACGTACA CGTGAAGGAA ATGACCTATT GCGTGAAATG 550 

ATCCAATCCG GTGTCATCCG ATACGGCGAG GAGTTTAAGA AGAGTATGGA 600 

GGCAGGCAAC TGGGACTTGT CGAAGATCGA TTACGATGAA TTGGCTAAGT 650 

10 CGCAGGCTAC CTTGGTATTC GGGCAGATGA ACGAACCGCC GGGCGCCCGC 700 

TCGTCCGTGG CCTTATCCGG TTTGACGATC GCCGAATCCT TCCGTGACAA 75 0 

AGCCTCTGAG GGAGAAAGAA AAGATATATT ATTCTTCATC GATAATATCT 800 

TCCGTTTCAC CCAAGCCGGT TCCGAGGTTT CCGCCTTGCT GGGGCGTATG 850 

CCTTCCGCCG T AGGTTAC C A ACCGACATTG GCCACGGAAA TGGGAGCTAT 900 

15 GCAAGAGCGT ATCACTTCAA CCAAGAAGGG CTCCATCACC TCCG 944 



20 



2 ) INFORMATION FOR SEQ ID NO: 250 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Bacteroides ovatus 

(B) STRAIN: ATCC 8483 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

3 5 TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGCCAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 100 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 2 00 

ATTGG CGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

40 CGACGGTATG AAAGGACTCG ACCG CAAAGG TGCATATTCC ATTCATCGCG 3 00 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GC TCTTC AC A 350 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 400 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 5 00 

45 GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 7 00 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 7 50 

50 AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 800 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 9 00 

AACGTATCAC GTCTACCCGC AAAGGTTC T A TCACCTCCG 93 9 



55 



2 ) INFORMATION FOR SEQ ID NO : 251 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 833 bases 
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10 



40 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251 



CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTTC 50 

AGAATGGTAA TGAGAGCCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

15 GGTATTGTGC GTACCATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 15 0 

TCTGGAAGTT AAAGACCTCG AGCACCCAAT CGAAGTACCA GTAGGTAAAG 2 00 

CAACCCTGGG CCGTATCATG AACGTCCTGG GTCAGCCGAT CGACATGAAA 250 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGTT CAGCACCTTC 3 00 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 50 

nvuAmnrsArrT GATGTGTCCG TTCGCGAAGG GCGGTAAAGT TGGTC TGTTC 400 

GGCGGTGCGG GTGTAGGTAA AACCGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATG AC C G A CTCCAACGTT 550 

CTGGACAAAG TATCCCTGGT TTACGGCCAG ATGAACGAGC CACCAGGAAA 600 

25 CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAGTTCCGTG 650 

ACG AAGGTC G TGACGTACTG CTGTTCGTTG ACAACATCTA CCGTTACACC 7 00 

CTGGCCGGTA CGGAAGTATC CGCACTGCTG GGTCGTATGC CATCAGCAGT 750 

AGGCTACCAG CCGACCCTGG CGGAAGAGAT GGGTGTTCTG CAGGAACGTA 800 

TCACCTCTAC CAAAACCGGT TCTATCACCT CCG 833 



30 



2) INFORMATION FOR SEQ ID NO: 2 52 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: ( , 

(A) ORGANISM: Stenotrophomonas maltophilia 
45 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 

GTGTACGACG CACTGAAGGT GGAAAACACC GAGATCACCC TCGAAGTCCA 50 

50 GCAGCAGCTG GGCGACGGCG TGGTGCGTAC CATCGCCCTC GGTTCCACCG 100 

ACGGCCTGAA GCGCAACCTG GTTGCCGTCA ACACCGGCCG TGGCATCTCG 150 

GTGCCGGTCG GCGCCGGCAC CCTGGGCCGC ATCATGGACG TGCTGGGCCG 200 

TCCGATCGAC GAAGCCGGCC CGGTCGCTGC CAGCGACAAC TGGGAAATCC 250 

ACCGCGCTGC GCCGTCGTAT GAAGACCAGT CCCCGGCCAC CGAGCTGCTG 3 00 

55 GAAACCGGCA TCAAGGTCAT CGACCTGATG TGCCCGTTCG CCAAGGGCGG 350 

C AAGGTC GGC CTGTTCGGCG GCGCCGGCGT CGGCAAGACC GTCAACATGA 400 

TGGAACTGAT CAACAACATC GCCAAGGCGC ACAGCGGCCT GTCCGTGTTC 450 

GCCGGCGTGG GTGAGCGTAC CCGTGAGGGC AACGACTTCT ACCACGAGAT 500 

GAAGGACTCC AACGTCCTGG ACAAGGTGGC GATGGTGTAC GGCCAGATGA 550 

60 ACGAGCCGCC GGGCAACCGT CTGCGCGTCG CCCTGACCGG CCTGACCATG 600 

354 
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GCCGAGTACT TCCGCGATGA GAAGGACGAA 
CCTGCTGTTC GTCGACAACA TCTACCGCTA 
TGTCGGCACT GCTGGGCCGC ATGCCGTCCG 
CTGGCCGAGG AAATGGGCGT CCTGCAGGAG 
5 TGGTTCGATC ACCTCGATC 



AACGGCAAGG GCAAGGACGT 650 
CACCCTGGCC GGTACCGAAG 7 00 

CGGTGGGTTA CCAGCCGACC 750 
CGCATCACCT CGACCAAGAA 800 

819 



2 ) INFORMATION FOR SEQ ID NO: 2 53 

10 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 49882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

25 AGTTTGAAGG CCCTTTGCCA AATATTC TC A ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGC TAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG AT AC T AC C G A TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 2 00 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 2 50 

3 0 AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 300 

TGGAGGAATC AACCGCATCA- -GAAATCCTTG TGACTGGTAT TAAAGTCGTT 350 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 400 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 450 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500 

3 5 CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCG AAAGC C GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 65 0 

TCAGGATTGA CCATTGCAGA AAGTTTC CGT GATGAGGGAC AAGATGTTTT 7 00 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 7 50 

40 CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 

CTCTATTACC TCTG 864 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 254 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 866 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescentis 

(B) STRAIN: ATCC 157 03 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 54 

355 
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TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 50 

gkccaaSaEc GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

SSSSSSX ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 150 

5 AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 200 

C^CGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGCCAC GTATTCGACG 250 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 300 

Smotmot CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 350 

CAAGACCCAG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 400 

10 COTATCTCCA SGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGTGC TGATCCAGGA AATGATTCAG CGTGTGGCTC AGAAC CACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT GAGGGTAACG 550 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 600 

GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 650 

15 GACCGCACTG ACCATGGCAG AGTACTTCCG TG AC GT AC AG AATC AGGAC G 7 00 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 750 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 800 

CCTGGC CGAT G AG ATGGGC G CGCTGCAGGA GCGAATCACT TCGACCCGTG 850 
GACACTCCAT CACCTC 



on 



2 ) INFORMATION FOR SEQ ID NO: 2 55 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 : _ 

- -(ID-MOLECULE- TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella abortus 
35 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

AAGGCCAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 50 

40 CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 100 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 15 0 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 200 

CGCATCATGA ACGTCATCGG CGAGCCGGTT G AC GAAGC AG GCCCCATCAA 250 

GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 300 

45 AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 350 

" CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG GCGGTGCAGG 400 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 450 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 500 

GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 550 

50 CGGCGGCGAA GGCTC CAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 600 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 650 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 7 00 

CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTC TTCTC GGCCGTATTC 7 50 

CTTCCGCTGT GGGTTATCAG CCGACGCTGG CAACCGACAT GGGCGCCATG 800 

55 CAGGAACGCA TCACCACGAC G AC C AAGGGT TCGATCACCT CG 842 



2 ) INFORMATION FOR SEQ ID NO: 256 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 256 

15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 50 

AAAAT AAC AG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 2 00 

CGACCCTTGG CCGTATCATG AACGTGCTGG GTCAGCCGAT C G AC ATG AAA 250 

20 GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCG AC C T TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 400 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 450 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 500 

2 5 GTACTCGTGA GGGTAAC G AC TTCTATCACG AAATG AC CG A CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 7 00 

CTGGCCGGTA CTGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 7 50 

3 0 AGGTTACCAG. CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 833 



3 5 2 ) INFORMATION FOR SEQ ID NO: 257 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) . STRAIN: ATCC 33432 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 



TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

55 TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATG AAAGGC G 2 50 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 4 50 

60 GCGATCGAGC ACTC CGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 

357 



CA 02307010 2000-05-19 



TCGTGAGGGT AACGACTTCT AC C AC GAG AT 
ACAAAGTTGC ACTGGTTTAC GGCCAGATGA 
CTGCGCGTAG CGCTGACCGG TCTGACCATC 
AGGCCGTGAC GTTCTGCTGT TCGTCGATAA 
5 C CGGT AC AG A AGTTTCTGCA CTGCTGGGTC 
TATCAGCCTA CTCTGGCAGA AGAGATGGGT 
CTCCACCAAG ACCGGTTCCA TCACTTCCG 



GACCGACTCC AACGTTC TGG 55 0 

ACGAGCCGCC AGGTAACCGT 600 

GCGGAGAAAT TCCGTGACGA 650 

CATCTATCGT TATACCCTGG 7 00 

GTATGCCATC TGCGGTAGGT 75 0 

GTTCTTCAGG AGCGTATTAC 800 

829 



2 ) INFORMATION FOR SEQ ID NO: 258 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 0 bases 
!5 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 



20 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Cedecea neterl 

(B) STRAIN: ATCC 33855 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 

TCCCTCAGGA CGGCGTACCG CGCGTTTATG ACGCGC TTG A GGTACAGAAC 5 0 

AACAATGAGA AGCTGGTGCT GGAAGTTCAG CAGCAGCTCG GCGGCGGTAT 10 0 

CGTGCGTACC ATCGCAATGG GTTCTTCCGA CGGTCTGCGT CGTGGTCTGG 15 0 

3 0 TAGTAACAGA CCTCGAGCAC CCGATCGAAG TCCCGGTAGG TAAAGCGACC 200 

CTTGGGGGTA— TCATGAACGT-GCTGGGTCAG-T CCGATCGACA TGAAAGGCGA 250 

CATCGGCGAA GAAGACCGTT GGGCGATTCA CCGCGCAGCA CCTTCCTACG 300 

AAGAGCTGTC CAGCTCTCAG GAATTGCTGG AAACCGGCAT CAAAGTTATC 350 

GACCTGATTT GTCCGTTCGC TAAGGGCGGT AAAGTAGGTC TGTTCGGTGG 400 

35 TGCGGGCGTA GGTAAAACCG TAAACATGAT GGAGCTGATC CGTAACATTG 450 

CGATCGAGCA CTCCGGTTAT TCCGTGTTTG CGGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACCGACTCCA AC GTTCTGG A 55 0 

TAAAGTAGCA CTGGTTTACG GCCAGATGAA CGAGCCACCA GGTAACCGTC 600 

TGCGCGTAGC GCTGACCGGT CTGACCATCG CGGAAAAATT CCGTGACGAA 650 

40 GGCCGCGACG TTTTGC TGTT CGTTGATAAC ATCTACCGTT ATACCCTGGC 7 00 

CGGTACCGAA GTATCTGCAC TGCTGGGTCG TATGCCTTCT GCGGTAGGTT 750 

ATCAGCCAAC TCTGGCAGAA GAGATGGGTG TTCTTCAGGA GCGTATTACC 800 

TCCACCAAGA CCGGTTCTAT CACCTCCGTA 830 



2 ) INFORMATION FOR SEQ ID NO: 259 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chryseobacterium meningosep txcum 

(B) STRAIN: CDC B7 681 

60 

358 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 59 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATT ACT ATG C CTATTGGTGA 200 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 2 50 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 3 00 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 3 50 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 400 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA C AG ATG AAC G AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 7 00 

TTTCTGGTCT TAG TTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 7 50 

CAAGGTAGAG ACGTTC TTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

2 0 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTACATCT G 931 



25 2) INFORMATION FOR SEQ ID NO: 2 60 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 
(D) .TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrohacter amalonaticus 

(B) STRAIN: ATCC 25405 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 60 



GTATCGTACG TACCATCGCA ATGGGTTCTT C C G AC GGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 2 00 

45 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 250 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 3 00 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 3 50 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 400 

TAC TCGTGAG GGTAACGACT TCTACCACGA AATG AC CG AC TCCAACGTTC 450 

5 0 TGGATAAAGT ATCC C TGGT A TATGGC C AG A TGAACGAGCC GCCGGGAAAC 5 00 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 7 00 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



2 ) INFORMATION FOR SEQ ID NO: 2 61 

60 

359 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: t ( 

10 (A) ORGANISM: Citrobacter braaku 

(B) STRAIN: ATCC 43162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

m CCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

?SS?gctg gaagttcagc AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

ScGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTC GT AT 200 

CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGG CGAT ATCGGTGAAG 2 50 

20 SgA^GTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 300 

^CTCTCAGG AACTGCTGGA AACCGGCATC AAAGTOATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

2 5 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

togSa?atgg CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTG AC GT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 7 00 

TATCCGCTCT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCGACC 750 

3 0 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

-GGGTTCTATC AC_^_ . - . ! 



35 2 ) INFORMATION FOR SEQ ID NO: 2 62 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 ( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Citrobacter koseri 

(B) STRAIN: ATCC 27156 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 



GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTTCA GCAGCAGCTC GGTGGC GGT A TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG ACCTTGAGCA 150 

CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 2 00 

55 TACTGGGCGA AC C AGTAG AC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGC T ATCC ACCGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 300 

GGAACTGCTG GAAAC CGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 350 

CGAAGGGCGG TAAAGTGGGT CTGTTCGGTG GTGCGGGTGT AGGTAAAACC 400 

GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 450 

60 CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 

360 
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ACCACGAAAT GACCGACTCC AACGTTATCG 
GGCCAGATGA ACGAGCCGCC GGGAAACCGT 
CCTGACCATG GCGGAGAAAT TCCGTGACGA 
TCGTCGACAA CATCTACCGT TACACCCTGG 
5 CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 
AGAGATGGGT GTTTTGCAGG AACGTATCAC 
TCACCTCCGT A 



ACAAAGTATC C C TGGTTT AC 550 

CTGCGCGTTG CGCTGACCGG 600 

AGGTCGTGAC GTTCTGCTGT 650 

CCGGTACGGA AGTATCCGCA 7 00 

TACCAGCCGA CCCTGGCGGA 7 50 

CTCCACCAAA ACCGGTTCTA 8 00 

811 



10 

2 ) INFORMATION FOR SEQ ID NO : 263 

( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 816 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter farmer i 

(B) STRAIN: ATCC 51112 

25 ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 100 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 15 0 

30 GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 25 0 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 300 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 35 0 

GTGTC CGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTC GGT GGTGCGGGTG 4 00 

3 5 TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAC CGATTC CAACGTTCTG GATAAAGTAT 55 0 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

40 CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 750 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 800 
AACCGGTTCT ATTACC 



45 



60 



2 ) INFORMATION FOR SEQ ID NO: 2 64 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitroJbacter freundxi 

(B) STRAIN: ATCC 8090 



361 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 



50 
100 



arrATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 

gagagSctog TgSgGAAGT tcagcagcag ctcggcggcg gtatcgtacg 

5 Saccatcgcc atgggttctt ctgacggtct gcgtcgtggt CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 200 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

5gaa^aagag CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 300 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

m JtcSgtccgt tcgctaaggg CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 400 

10 t^aSSaS IccgSaaaca tgatggagct gatccgtaac atcgcgatcg 450 

AACACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

a?cc1^Scta tatSgccaga tgaacgagcc gcctggaaac cgtctgcgtg 600 

IS TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GA?G?TCrSc TCTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 700 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 800 
AAAAC CGGTT CTATCACCT 

20 



2 ) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
<D) TOPOLOGY: Linear 

30 . 

— (ii) MOLECULE LTYPE: Geno mic DNA -_- — - - 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter koseri 
35 (B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

40 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 250 

AAGAAGAGCG TTGGGC T ATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 300 

45 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TC CGTAAC AT CGCGATCGAA 450 

CACTCCGGTT ACTC CGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 7 50 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 800 

55 AACCGGTTCT ATCACCTCCG TA 0tiz 



2 ) INFORMATION FOR SEQ ID NO: 266 

60 

362 



CA 02307010 2000-05-19 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

{ vi ) ORIGINAL SOURCE: 
10 (A) ORGANISM: Citrobacter sedlakxi 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

15 GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTACAGA ATGGTAATGA 50 

GCGTCTGGTG C TGG AAGTTC AGCAGCAGCT CGGTGGCGGT ATTGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTTGAGC ACCCGATCGA AGTC CCGGTA GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GACATCGGTG 250 

2 0 AAGAAGAGCG TTGGGCTATC CACCGTGCCG CGCCGTCCTA TGAAGAGTTG 3 00 

TCTAACTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTC CGTTC GCGAAGGGCG G TAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

25 TAACGACTTC TAC C AC G AAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

CCCTGGTGTA CGGCCAGATG AACGAGCCGC CTGGAAACCG TCTGCGCGTC 600 

GCACTGACCG GTC TG AC CAT GGCTGAGAAG TTCCGTGACG AAGGTC GTG A 650 

CGTTC TGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGC TGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 7 50 

30 ACTCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCAACCAA 800 

AACCGGTTCT ATCACCTCCG '82 0 



35 2 ) INFORMATION FOR SEQ ID NO : 267 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrohacter werkmanll 

(B) STRAIN: ATCC 51114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 

50 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 2 00 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 3 00 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 35 0 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 40 0 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 45 0 

60 TCCGGTTACT CAGTGTTTGC GGGC GTTGGT GAACGTACTC GTGAGGGTAA 500 

363 
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CGACTTCTAC CACGAAATGA 
TGGTATATGG CCAGATGAAC 
CTG AC CGGTC TGACCATGGC 
TCTGCTGTTC GTTGATAACA 
5 TATCTGCACT GCTGGGTCGT 
CTGGCGGAAG AGATGGGTGT 
CGGTTC 



CCGACTCCAA CGTTCTGGAC 
GAGCCGCCGG GAAACCGTCT 
TGAGAAGTTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCATCAG CGGTAGGCTA 
TCTGCAGGAA CGTATCACCT 



AAAGT ATC C C 550 

GCGCGTTGCG 600 

GTCGTGACGT 65 0 

GGTACTGAAG 7 00 

CCAGCCAACC 750 

CTACCAAAAC 800 
806 



10 

2 ) INFORMATION FOR SEQ ID NO : 2 68 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 810 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 

( vi ) ORIGINAL SOUkCjg : 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

OTGOTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC gtacgtacca 100 

TCGCCATGGG ^TCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

30 ???Sagca22 CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 200 

catcSacgtt CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 250 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 350 

tccgttcgEt AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

35 GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

?GACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGT ATC CC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

4 0 ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 7 00 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

SSgSgGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 



45 



CGGTTCTATC 



2 ) INFORMATION FOR SEQ ID NO : 2 69 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



60 



364 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 69 

TTGAGAACGG AGATTTGCCG CAGCTATTGA CCGCTATTGA AATTCCTCTG 50 

ISSaSJgtg AATCTCTGAT TGTCGAAGTT GCTCAGCATA TCGGTGATGA 100 

c ^?S5?CGC TGTATCGCTA TGGGCGGTAC AGATGGTCTG GTTCGTGGAA 150 

JSS^SS ?g!tacagga TCCGCAATCC GTCTACCGGT GGGAAAAGAA 200 

a??S3ggaa GAATGTTCAA TGTCCTCGGA CGTGAAATTG ATGGTCTGGG 250 

aSotagga ACGGATAACA CACTGCCGAT CCACAGACAG GCACCGGGCT 3 00 

TTGAGGAGCA GCAGACATCC GCAGAAATGC TGGAAACAGG AATTAAGGTC 350 

10 ATTGACCTGT TATGTCCATA TTCCAAGGGT GGTAAGATTG GTTTGTTTGG 400 

?^?2cGGGA GTAGGTAAAA CCGTACTGAT TCAGGAGCTG ATTCATAATA 450 

TCGCCAAGGA ACATGGTGGA ATGTCCGTCG TTACCGGTGT AGGGGAGAGA 500 

AC C C GTG AAG G AAAC G AC AT GTATCATGAA ATGAAGGACA GCGGTGTCCT 550 

TGATAAGACC GTACTGGTTT ACGGACAGAT GAATGAATCA CCGGGTGCCA 600 

15 GAATGCGTGT CGGTCTGACC GGGCTGACGA TGGC GGAAT A TTTCCGTGAT 650 

CACGACCATC AGGATGTATT GCTGTTTATT GATAATATTT TCCGTTTTAC 7 00 

CCAGGCGGGA AGTGAAGTAA GTGCCCTGCT GGGACGTATG CCAAGTGCAG 750 

TAGGCTATCA GCCGACACTT GCGACAGAAA TGGGACAGCT GCAGGAGCGC 800 

ATTACATCCA CGAAGGATGG TTCCATT 827 

20 • 



2) INFORMATION FOR SEQ ID NO: 27 0 

2 5 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN : ATCC 13124 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 



GCAAATGATG 

40 CGATGGAAAA 
TAGTTAGAAC 
GAAGCTGTTG 
ATTAGGAAGA 
AAGTTAATAA 

45 GAAGAACAAT 
CGACCTTTTA 
GTGCAGGTGT 
GCTAAAGAGC 
AAGAGAAGGT 

5 0 AAAATACAGC 
ATGAGAGTTG 
AGGTCAAGAC 
CTGGATCAGA 
TACCAACCAA 

5 5 ATC AACTACC 



AACTTCCTAA 
ATCTTAGTTT 
TATAGCTATG 
ATACAGGAGC 
ATATTTAACG 
CGAAGAAAAA 
CAGTTGTTCC 
GCACCTTACC 
TGGTAAAACA 
ACGGTGGACT 
AATGACC TTT 
ATTAGTATTT 
CTTTAACAGG 
GTGTTATTAT 
GGTTTCAGCT 
CTCTTGCTAC 
CATGGATCAA 



TATATTTAAT 
GTGAGGTAGA 
GAAGCTACTG 
ACCTATATCA 
TATTAGGTAA 
TATCCAATTC 
TCAAATGTTT 
AAAGAGGGGG 
GTTC TTATCC 
TTCTGTATTC 
ACTATGAAAT 
GGACAAATGA 
AC TT ACTATG 
TCATAGATAA 
TTATTAGGAA 
AGAGATGGGA 
TTACATCAG 



GCAATACACA 
GCAACACGTA 
AAGGACTAAG 
GTACCAGTTG 
ACCACTAGAT 
ATAGACCAGC 
GAGACAGGAA 
AAAAATCGGT 
AAGAGCTTAT 
ACAGGAGTTG 
GATGGAATCA 
ACGAACCACC 
GCTGAGTACT 
CATATTCAGA 
G AAT AC CATC 
GCTCTTCAAG 



TAAAAATGGA 50 

GGAGACGATA 100 

AAGAGGTGTA 150 

GTGAATGCGT 2 00 

AGTGGAGCTG 2 50 

TCCATCATTT 3 00 

TAAAGGTTAT 3 50 

CTATTTGGAG 400 

AAACAACATA 450 

GAGAAAGATC 5 00 

GGAGTTATAA 550 

TGGAGCAAGA 600 

TCAGAGACCA 650 
TTCTCACAAG ' 7 00 

AGCTGTTGGT 7 50 

AGAGAATCAC 800 
829 



60 



2 ) INFORMATION FOR SEQ ID NO : 271 

365 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271 
TTGTCCACGT TGGATRTCTT CA 



15 2 ) INFORMATION FOR SEQ ID NO : 272 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 (vi) ORIGINAL SOURCE: ( 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



CCCGCGTGGC GAGCTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA '50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTGCT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 2 00 

3 5 GCG AC GTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGC TTGG AT 2 50 

gSScaggtc TCGGCCGCGA TGGTGAGCAG TGGGGAATTC ACCGCGATCC 300 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 3 50 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 400 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

40 CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

45 CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 800 
AGGAGCGTAT TACCTCTA 



2 ) INFORMATION FOR SEQ ID NO: 273 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 

366 
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(vi) ORIGINAL SOURCE: , 

(A) ORGANISM: Corynebacterium pseudodip/athera tacum 

(B) STRAIN: ATCC 10700 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

CGATGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 5 0 

GTCGCCAAGA CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 

GATCCGGACC ATTGCGTTGG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 

10 AGGTTATCGA CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 200 

AAAGGACACG TCTTCAATGC GCTCGGTGAG TGTTTGG AC G AACCAGGATT 2 50 

GGGCCGCGAC GGCGAACAGT GGGGAATCCA CCGCGATCCG CCACCATTCG 300 

ATGCGCTGGA GGGCAAAACC GAGATTCTGG AG AC TGG AAT CAAGGTTATC 350 

GACCTCCTTA CCCCTTACGT TAAGGGTGGC AAAATTGGTC TGTTC GGTGG 400 

15 CGCCGGCGTC GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 

CTCGTAACTT CTCCGGTACT TCCGTGTTCG CCGGCGTCGG TGAGCGTACC 500 

CGTGAGGGTA CTGACCTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 550 

AGACACCGCC CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTC GT A 600 

TGCGCGTGGC CTTGTCTGGT CTAACCATGG CTGAATATTT CCGCGACGTT 650 

2 0 CAAAACCAGG AC GTTTTGTT GTTCATTGAC AACATCTTCC GTTTTACTCA 700 

GGCAGGTTCC GAGGTTTCCA CGCTGTTGGG CCGTATGCCT TCCGCCGTGG 7 50 

GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 800 

ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 833 



25 



40 



2) INFORMATION FOR SEQ ID NO: 274 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 417 bases 

(B) TYPE: Nucleic- acid 

(C) " : ~STRANDEDNESST "Double 

( D ) TOPOLOGY : Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coryn eba c t erl um ulcerans 

(B) STRAIN: NCTC 8665 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 274 

CCGTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 50 

ACCTCTGTGT TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

45 CTGGGTCGAG ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

TCGGCCAGAT GGACGAGCCG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 200 

GCGCTGACCA TGGCGGAGTA CTTC CGCG AT GTGCAGAACC AGGACGTGCT 2 50 

GCTGTTCATC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 3 00 

CCACCCTGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 350 

50 GCGGACGAGA TGGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 400 

CTCCATCACC TCGATGC 417 



55 2 ) INFORMATION FOR SEQ ID NO : 275 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

367 
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15 
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25 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: t 

(A) ORGANISM: Corynebactenum urealyticum 

(B) STRAIN: ATCC 43042 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 

GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG 
GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC 
ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT 
GCAGAGGTCA AGGACACCGG TAAGCCGATC TCTGTGCCAG 
TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG 
GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA 
TTCGACGAGC TCGAGGGTAA GACCGAGATC CTGGAGACCG 
CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT 
GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT 
ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG 
TACCCGTGAG GGTACGGACC TC TTCGTGG A C-ATGGAGGAG 
TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC 
CGTATGCGCG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT 
TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC 
CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT 
GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC 
CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC 



TTGACCTCGA 50 

C TGGGTG AC A 100 

GGTCCGCGGT 150 

TCGGCGATGT 200 

GATGAGCCAG 250 

GCCACCGGCA 300 

GCGTTAAGGT 350 

GGCCTCTTCG 400 

GATTACCCGT 450 

TCGGCGAGCG 500 

ATGGGCGTGC 550 

GCCGGGAGTC 600 

ACTTCCGCGA 65 0 

TTC CGTTTC A 700 

GCCGTCCGCA 7 50 

TGCAGGAGCG 800 
835 



30 

2 ) INFORMATION FOR SEQ ID NO : 276 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 

CCCGTCACGC TGTCCCGAAA GTGTACGATG CTTTGCAGGT GGATGAAAAT 5 0 

AATTTAACGC TCGAAGTCCA ACAGCAACTC GGGGACGGTG TCGTGCGCAC 100 

AATTGC C ATG GGCAGCACTG AGGGC TT AAA ACGCGATATC GCCGTAAAAA 150 

50 ATACGGAAAA ACCGATTGAA GTTCCCGTAG GAAAAGAAAC TTTAGGTCGT 200 

ATCATGAACG TGCTGGGTGA GCCGATCGAT GAGTTAGGTC CCATTAATTC 250 

. AAAAGAAAAA CTCCCTATTC ATCGTCCTGC GCCGAGCTTT ATTGAGCAAT 300 

CTGGCGCTAC CGAATTATTA GAAACCGGTA TTAAAGTGGT CGATTTGCTT 350 

TGCCCCTTTG CTAAGGGAGG CAAAGTGGGT C TTTTTGG AG GCGCGGGCGT 400 

55 TGGAAAAACG GTTAATATGA TGGAATTAAT CCGTAATATC GCCATTGAAC 450 

ACAGCGGTTA TTCTGTTTTT GCGGGTGTGG GAGAAAGAAC GCGAGAAGGC 500 

AATGATTTTT ATCATGAAAT GAAAGAATCC AATGTCTTGG ATAAAGTGGC 550 

GTTGGTGTAC GGACAAATGA ACGAGCCGCC AGGGAACCGC TTGC GGGTGG 600 

GTTTGACGGG GCTTACGCTG GCGGAAGCCT TCCGTGACGA AGGACGCGAC 650 

60 GTTCTGTTAT TTATCGATAA TATCTTTCGT TAC AC TTTGG CAGGGGTTGA 700 
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AGTCTCTGCC CTCCTCGGTC GGATGCCATC GGCTGTGGGT TATCAGCCGA 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG AAC GC ATT AC TTC C AC T AAA 
AAAGGGTCCA TTACGTCG 



750 
800 
818 
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2 ) INFORMATION FOR SEQ ID NO: 277 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshlnae 

(B) STRAIN: ATCC 33379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 



TTCCCGCAGG 
CGGTGCCACG 
TAGTTCGCTG 
GAGGTTGAAG 
TCTGGGCCGT 
AGATCGGTGA 
GAAGATCTGT 
CGACCTGATT 
GGGC CGGTGT 
GCTATCGAGC 
CCGTGAGGGT 
ATAAAGTTTC 
CTGCGCGTGG 
AGGTC GTGAT 
CCGGTACTGA 
TATCAGCCGA 
CTCCACTAAG 



ACGCCGTGCC 
AAACTGGTAC 
CATCGCGATG 
ACCAAGACCA 
ATCATGAACG 
AGAAGAGCGT 
CTAACTCTCA 
TGCCCGTTCG 
GGGTAAGACC 
ACTCCGGTTA 
AACGACTTCT 
TC TGGTGT AT 
CGCTGACCGG 
GTACTGTTGT 
AGTCTCCGCT 
CTCTGGCGGA 
ACCGGGTCCA 



GAAGGTGTAT 
TGGAAGTGCA 
GGCTCTTCCG 
TCCGATCGAG 
T AC TGGGTG A 
TGGGCTATCC 
GGAACTGCTG 
CTAAAGGCGG 
GTTAACATGA 
CTCAGTCTTC 
ACCACGAGAT 
GGTCAGATGA 
TCTGACCATG 
TCATCGATAA 
CTGCTGGGCC 
GGAAATGGGG 
TCACCTCTG 



AACGCACTGG 
GCAGCAGCTG 
ACGGTCTGCG 
GTTCCTGTTG 
TCCGGTCGAC 
ATCGTGCTGC 
GAGACCGGCA 
TAAAGTGGGC 
TGGAGC TTAT 
GCCGGTGTGG 
GACCGATTCC 
ACGAGCCACC 
GCGGAGAAAT 
CATCTACCGT 
GTATGCCGTC 
GTGCTGCAAG 



AAGTAAAAGG 50 

GGTGGCGGCG 100 

CCGTGGGCTA 150 

GC AAGGC G AC 200 

ATGAAGGGCG 2 50 

ACCGAGCTAT 3 00 

TTAAGGTTAT 3 50 

CTGTTCGGTG 400 

CCGTAACATC 450 

GTGAGCGTAC 500 

AACGTATTGG 550 

GGGAAACCGT 600 

TC CGTGATGA 650 

TATAC CTTGG 700 

GGCGGTAGGT 7 50 

AGCGTATTAC 800 
829 
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2 ) INFORMATION FOR SEQ ID NO : 27 8 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella tarda 

(B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 
ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 

369 



50 
100 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC 
CAAGACCATC CGATCGAGGT TCCGGTC GGT 
CATGAACGTA CTGGGTGATC CGGTCGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCGC 
5 AACTCTCAGG AGCTGCTGGA GACCGGCATC 
CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT 
GTAAGACCGT TAACATGATG GAGCTTATCC 
TCCGGTTACT CCGTATTTGC CGGTGTAGGC 
CGAC TTCTAC CACGAGATGA CCGACTCCAA 

10 TGGTATACGG CCAGATGAAT GAGCCGCCGG 
TTGACCGGTC TGACCATGGC GGAGAAATTC 
GTTGTTGTTC ATCGATAACA TTTATCGTTA 
TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG 
CTGGCGGAAG AGATGGGTGT GTTGCAAGAG 

15 GGGCTCTAT 



GTGGGCTGGT GGTTGAAGAC 150 

AAGGCGACCC TGGGC C GTAT 200 

GAAGGGCGAG ATCGGCGAAG 250 

CGAGCTATGA AG ATCTGTC C 3 00 

AAGGTTATCG ACCTGATTTG 3 50 

GTTCGGTGGT GCCGGTGTGG 4 00 

GTAACATCGC TATCGAGCAC 45 0 

GAGCGTACCC GTGAGGGTAA 500 

CGTATTGGAT AAAGTTTCTC 550 

GAAACCGTCT GCGTGTGGCG 600 

CGTGATGAAG GTCGCGATGT 650 

TACCTTGGCT GGTACCGAAG 7 00 

CGGTAGGTTA TCAGCCGACC 7 50 

CGTATCACCT CAACGAAGAC 800 

809 



20 



2 ) INFORMATION FOR SEQ ID NO: 27 9 



( i ^ SEQUENCE CHAEACTERX3 TXCS : 
{h) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Elkenella corrodens 
(„B.) 1 STRAIN.: ATCC 23834 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

35 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 100 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 15 0 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 200 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 250 

40 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 3 00 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 350 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 400 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA C ATTGC C AAA 450 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

45 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

^ TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 65 0 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 7 00 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 7 50 

50 TCTGCTGTGG GTTATCAGCC AAC TTTGGCT GAAGAAATGG GTCGTTTGCA 800 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 840 



55 2 ) INFORMATION FOR SEQ ID NO : 2 80 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

370 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 



GCCGTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT GTACGTACCA 100 

TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTC TGG A AGTAACGGAC 150 

CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

15 CATGAACGTA CTGGGCGAAC CAGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

2 0 TCCGGTTACT CTGTGTTTGC GGGC GTAGGT GAACGTACTC GTGAGGGTAA 5 00 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

25 TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA TCAGCCGACG 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 
CGG 



2 ) INFORMATION FORlSEQ. ID NO: 281 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 833 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



803 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter awnigenus 

(B) STRAIN: ATCC 33072 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GT AC TATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

50 TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT C G AC ATG AAA 250 

GGCGACATCG GTGAAGAAGA C CGTTGGGC A ATCCACCGTG CAGCACCTTC 3 00 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 50 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 400 

55 GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 500 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC C AC C AGG AAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGC TG AG AAGTTCCGTG 650 

60 ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 7 00 

371 
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CTGGCCGGTA CTGAAGTATC TGCGCTGCTG GGCCGTATGC CTTCAGCGGT 750 
agg?Saccag CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 800 
TCACTTCTAC CAAAACCGGT TCTATC AC CT CCG 



2 ) INFORMATION FOR SEQ ID NO : 2 82 

( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter asburxae 

(B) STRAIN: ATCC 35953 



20 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 



GCCGTACCAC GCGTGTACGA CGCGC TTG AG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

2 5 TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 300 

AGCTCTCAGG AACTGCTGGA AAC CGGTATC AAAGTTATC G AC CTG ATGTG 350 

3 0 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTC GGTGGT GCGGGTGTTG 400 

-GTAA^COT~~AA^^ GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

35 CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 
CGGTTCTATC 

40 

2 ) INFORMATION FOR SEQ ID NO: 283 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cancerogenus 
55 (B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
60 CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 

372 
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TCGCCATGGG TTCTTCCGAC GGTCTGCGTC 
CTTGAGCACC CGATCGAAGT CCCGGTAGGT 
CATGAACGTA TTGGGTCAAC CAATCGACAT 
AAGAGCGTTG GGCTATC CAC CGCGCAGCAC 
5 AGCTCTCAGG AACTGCTGGA AACCGGCATC 
TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT 
GTAAAACCGT AAACATGATG GAGCTGATCC 
TCCGGTTATT CCGTGTTTGC GGGCGTGGGT 
CGACTTCTAC CACGAAATGA CCGACTCCAA 

10 TGGTGTACGG CCAGATGAAC GAGCCACCAG 
CTGACCGGCC TGACCATGGC TGAGAAGTTC 
ACTGCTGTTC GTTGATAACA TCTACCGTTA 
TATCTGCACT GCTGGGTCGT ATGCCTTCAG 
CTGGCGGAAG AGATGGGTGT TCTTCAGGAA 

15 CGGTTCTATC A 



GTGGTCTGGA AGTAAAAGAC 15 0 

AAAGCAACAC TGGGTCGTAT 200 

GAAAGGCGAC ATCGGTGAAG 2 50 

CTTCCTACGA AGAGCTGTCC 300 

AAAGTTATCG ACCTGATGTG 3 50 

GTTCGGTGGT GCGGGTGTAG 400 

GTAACATCGC GATCGAGCAC 450 

GAACGTACTC GTGAGGGTAA 500 

CGTTCTGGAT AAAGTATCCC 55 0 

GAAACCGTCT GCGCGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACCGAAG 7 00 

CGGTAGGTTA TCAGCCTACG 750 

CGTATCACCT CAACCAAAAC 800 

811 



2 ) INFORMATION FOR SEQ ID NO: 2 84 



20 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: En terobac ter cloacae 

__(B) ^STRAIN: ATCC 13 047; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

3 5 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 5 0 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 20 0 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

4 0 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

45 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 55 0 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 7 00 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

5 0 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC ACTTCCG 817 



55 2 ) INFORMATION FOR SEQ ID NO : 2 85 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

373 
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10 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 



AATGAGAGCC TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT 5 0 

GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC GGTCTGGAAG 100 

TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC 15 0 

GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGATAT 2 00 

15 CGGCGAAGAA GAGCGTTGGG CGATCCACCG CGCTGCGCCG TCCTATGAAG 250 

" AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA GGTAATGGAC 3 00 

CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 35 0 

GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 400 

TCGAGCACTC CGGCTACTCC GTGTTTGCGG GC GTGGGTG A ACGTAC TCGT 450 

20 GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA 500 

AGTATCCCTG GTGTACGGCC AGATGAACSA GCCSCCGGSA AACCGTCTGC 550 

GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TGACGAAGGT 600 

CGTGACGTTC TGC TGTTCGT CGATAACATC TACCGCTATA CCCTCGCCGG 650 

TACTGAAGTA TCCGCACTGC TGGGC CGT AT GCCTTCTGCA GTAGGTTACC 700 

25 AGCCGACGCT GGCGGAAGAG ATGGGTGTTC TGCAGGAACG TATCACCTCC 750 

ACCAAAACCG GTTCTA 766 



30 2 ) INFORMATION FOR. SEQ ID NO : 286 



35 



40 



45 



50 



55 



60 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC 
TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA 
CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC 
CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA 
AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG 
TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT 
GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC 
TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC 
CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC 
TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT 
CTGACTGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG 
TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCCTGGCC 
TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA 

374 



GTAACGAGAG 50 

GTGCGTACCA 100 

AGTGAAAGAC 150 

TGGGTCGTAT 200 

ATCGGTGAAG 250 

AGAGCTGTCC 300 

ACCTGATGTG 3 50 

GCGGGTGTAG 400 

GATCGAGCAC 450 

GTGAGGGTAA 5 00 

AAAGTATCCC 550 

GCGCGTTGCG 600 

GCCGTGACGT 650 

GGTACGGAAG 7 00 

TCAGCCAACG 7 50 



\ 



CA 02307010 2000-05-19 



CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CGACCAAAAC 800 
CGGTT 



2 ) INFORMATION FOR SEQ ID NO : 2 87 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 791 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobac ter sakasaku 

(B) STRAIN: ATCC 29544 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 5 0 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

CCGACGGTCT GCGTCGCGGT CTGCCTGTTG C AG AC C TTG A GCACCCGATC 150 

2 5 GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 200 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 2 50 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCC AGCTC TCAGGAACTG 3 00 

CTGGAAACCG GCATCAAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 3 50 

CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 400 

30 TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 450 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 55 0 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 600 

ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

3 5 CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 7 00 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 7 50 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAAC CGGTT C 791 

40 

2) INFORMATION FOR SEQ ID NO: 288 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 839 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS-: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 15 0 

60 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 200 

375 
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GATACTCTAG GTCGTGTGTT T AAC GT ATT A 
AGC AC CTTTT CCAGAAGATG CAGAAAGAAG 
CTGCTTTTGA AGACCTTAGT ACAAGTAACG 
AAGGTTATCG AC TT ATT AG C CCCTTATTTA 
5 ATTCGGTGGT GCCGGTGTTG GTAAAACCGT 
AT AATATC GC CCAAGAACAC GGTGGTATTT 
GAACGTACTC GTGAAGGGAA CGACCTTTAT 
CGTTATTGAG AAAACAGCCA TGGTGTTCGG 
GTGCACGTAT GCGTGTTGCC TTGACTGGTT 
10 CGTGATGAAG AAGGACAAGA TGTGTTGCTA 
CTTTACTCAA GCCGGATCAG AAGTTTCTGC 
CAGCCGTTGG GTATCAACCA ACTTTGGCAA 
GAACGAATCA CTTCAACCAA AAAAGGTTCG 



GGTGAAACGA TTGATAAAGA 2 50 

CGGCATTCAT AAAAAGGCGC 3 00 

AGATTTTGGA AACAGGGATC 3 50 

AAAGGTGGGA AAGTCGGACT 400 

TTTGATCCAA GAATTAATTC 450 

CAGTGTTTAC CGGTGTTGGG 500 

TATGAAATGA AAGACTCTGG 550 

ACAAATGAAC GAGCCGCCTG 600 

TGACATTAGC TGAATATTTC 650 

TTTATTGACA ACATCTTCCG 7 00 

CTTATTAGGA CGTATGCCAT 75 0 

CTGAAATGGG GCAATTACAA 800 

ATCACTTCA 839 



15 

2 ) INFORMATION FOR SEQ ID NO : 289 

( i ) SEQUENCE CHARACTERI STICS : 
on (A) LENGTH: 847 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

25 ( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casselirlavus 

(B) STRAIN: ATCC 25788 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 289 



TGATTGTTTA 


50 


GCCTTAGAAC 


100 


AGATGGCTTA 


150 


CTGTTCCAGT 


200 


GACACCATCG 


250 


GATTCACAAA 


300 


TCCTTGAAAC 


350 


GGGGGGAAAG 


400 


GATCCAAGAA 


450 


TCTTCACAGG 


500 


GAAATGAAAG 


550 


AATGAACGAA 


600 


CATTAGCCGA 


650 


ATCGATAATA 


700 


ACTAGGTCGG 


750 


AGATGGGGCA 


800 


ACGTCTA 


847 



2) INFORMATION FOR SEQ ID NO: 2 90 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



376 
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( i i ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terococcus durans 

(B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 



TTTCTTTAGA 

10 AAAAATGATG 
AGGTGACGGT 
AACGCGGAAT 
GGTAAAGAAA 
TTTATCTGCA 

15 AAGCACCAAA 
GGGATCAAAG 
TGGATTATTC 
TGATCCATAA 
GTTGGTGAAC 

20 TTCAGGAGTT 
CACCAGGTGC 
TACTTCCGTG 
TTTCCGTTTC 
TGCCTTCTGC 

25 TTACAAGAAC 



CCAATCCTTA 
AGAAGAAATC 
GTCATCCGTA 
GGAAGTCATT 
CGTTAGGTCG 
CCTTTCCCAG 
CTTTGATGAG 
TTATTGACTT 
GGTGGTGCCG 
TATCGCTCAA 
GTACACGTGA 
ATTGAAAAAA 
AC GT ATGCGT 
ATGTTGAAGG 
ACTCAAGCCG 
CGTTGGGTAC 
GGATCAC TTC 



CCAGATATCA 
GAAAGTTGTT 
CAATCGCGAT 
GATACAGAAA 
TGTATTCAAT 
AAGATGCAAA 
TTAAGT AC AA 
GCTTGCTCCT 
GTGTAGGTAA 
GAACACGGTG 
AGGTAATGAC 
CAGCCATGGT 
GTTGCCTTGA 
GCAAGACGTG 
GTTCAGAAGT 
CAACCAACGC 
AACGAAAAAA 



ACAACGCTTT 
CTTGAAACAG 
GGAATCAACA 
AAGCAATTTC 
GTATTAGGAG 
ACGTAGCGAA 
GTACTGAGAT 
TACTTAAAAG 
AAC TGTATTG 
GTATTTCTGT 
CTTTATTATG 
GTTTGGTCAA 
CTGGTTTGAC 
CTATTGTTTA 
TTCTGCCCTA 
TAGCAACAGA 
GGTTCAATCA 



AGTTG TTT AT 50 

CGCTAGAATT 100 

GATGGTTTAC 150 

TGTACCAGTG 2 00 

ATACGATCGA 250 

ATC C ATAAAA 300 

CCTTGAAACT 3 50 

GTGGGAAAGT 400 

ATCC AAGAAT 450 

ATTTACTGGT 5 00 

AAATGAAAGA 550 

ATGAACGAAC 600 

GATTGCTGAA 650 

TTG ATAATAT 7 00 

TTAGGTCGTA 750 

AATGGGTCAA 800 

CTTCA 845 



2 ) INFORMATION FOR SEQ ID NO : 291 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Enterococcus faecalls 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 

45 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGC TTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC C TGTTGGT AA 200 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 300 

CCAGCATTTG ATGAATTAAG T AC C AGTAAT GAAATTTTAG AAACAGGGAT 3 50 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 400 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 450 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 500 

5 5 TG AAC GG AC A CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AAC ATTTTC C 7 00 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

60 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 

377 
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AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 840 



5 2 ) INFORMATION FOR SEQ ID NO: 292 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



20 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



TTTCTTTAGA CCHKTCcrrA LCrt^TWTc/i ncnn^vj^ni i ~w xxOxj-j.^-. -XX 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGAC GG A GTGATCC GGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 200 

25 GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 250 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 3 00 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 350 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 400 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGT AC TG ATCCAAGAAC 45 0 

30 TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 65 0 

TATTTCCGTG ATGTAGAAGG TCAAGATGTA C TGTTGTTTA TC G AC AAC AT 7 00 

35 TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 800 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 83 1 



2 ) INFORMATION FOR SEQ ID NO: 293 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 100 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

60 TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 200 

378 
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ACTTCTACGA AAAAAGGATC TGTAAC 

15 



250 
300 
350 
400 
450 



^^GAGTAT TTAATGTACT AGGCGATACT ATCGACTTGG AAGCACCATT 
CCCAGAAGAT GCCAAACGTA GTGGCATCCA CAAAAAAGCC CCMATTTCG 
iTS^ TTGTC AACAAGTACA GAAATCCTTG AAACTGGGAT CAA^gStc 

gatttattag ctccttactt aaaaggtggt aaagtcggct ^gttcggJS 

5 TGCCGGTGTT GGTAAAACCG TATTGATTCA AGAATTGATT 

CTCAAGAGCA TGGGGGAATT TCAGTATTTA CCGGTGTTGG CGAACgS^G 

CGTGAAGGTA ATGACTTGTA TTATGAAATG AAAGAATCAG GCGTTA^cS Hn 

AAAGACAGCC ATGGTTTTCG GTCAAATGAA TGAACCACCA GGTGCCCGGA Inn 

1 n J?^ GTTGC TTTGACTGGT TTGACCATTG CTGAGTATTT SSSSSS «S 

GAA ? GA S^S ATGTGCTCTT GTTTATCGAT AATATTTTCC GTTTCaScI 700 

™ G S GTTCT GAAGTATCTG CCTTGTTAGG CCGGATGCCA TCAGCCGTTG l^n 

S USSSSSt GTCAATTACA ssssss sss 



826 



2 ) INFORMATION FOR SEQ ID NO: 294 

( i ) SEQUENCE CHARACTERISTICS • 
20 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 
3Q (B) STRAIN: ATCC 43076 y " CUfl 

(xi) SEQUENCE DESCRIPTION: _ SEQ ID NO: 294 

ITaSoSS- CCAATCCTTA CCAGACATCA ACAATGCGTT GGTGGTCTAT ^ 
_ AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAgSc"£ inn 

T GGTGATGGT gtc gtacgta cgattgctat gtcgtcaaca gacggSt^S i 2n 
AACG ^ G 2 GAT ggaagtcatc gatacaggag catcaatttc tgttStgS? Inn 
^gaaagaga cattaggacg tgtatttaac gttttagggg agac5atcgI oln 

CCATTCCCAG AAGAAGTAGC ACGAGATGGT A^TCA^AAA^ III 

An AG S CA £ CTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA \ln 

40 GGGATTAAAG TAATCGATTT ATTAGCGCCT TACTTAAAAG GTGOraAari 11 r, 

TGGTTTATTP rr"rrr"TVf-rv-. ^mom^XXZ, iaaaa(j CaTGGGAAAGT 400 

m^mi., ,i GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT Ac.n 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTCCrS tnn 

2™™*°° GTACTCGTGA AGGAAATGAC cSEESSS aIStG^KgT tin 

* = G T CGGGCGTT attgagaaaa cagcgatggt ttttggacaa atcaSgIa"? 

45 ^ccaggtgc acgtatgcga gttgctttaa ctggtttS? cStcSgIa t^n 

Hips SSSSSS SSSSKZS sssg™ illi 



2 ) INFORMATION FOR SEQ ID NO : 2 95 

( i ) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



379 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 3 5469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGC GTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 2 00 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 2 50 

AAGAGC GTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

15 AACTCTCAGG AACTGC TGG A AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA CGTTATCGAC AAAGTATCCC 550 

2 0 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTG AC AAC A TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



2 ) INFORMATION FOR SEQ ID NO: 296 

30 

(i) SEQUENCE CHARACTERISTICS : 
( A") "^"LENGTH r "822 bases" 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

45 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 25 0 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

55 TAACG AC TTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 65 0 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 7 00 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

60 ACCCTGGCGG AAGAGATGGG CGTTC TGC AG GAGCGTATCA CCTCCACCAA 800 
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AACCGGTTCT ATCACCTCCG TA 822 



5 2 ) INFORMATION FOR SEQ ID NO: 297 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33821 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 



CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 5 0 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 15 0 

CACCCGATCG AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 2 00 

25 CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 2 50 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 3 00 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTC CGTT 35 0 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 400 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 450 

3 0 TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG,ATGACCGACT_CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 600 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 650 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 7 00 

35 CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 8 00 

TATCACCT 808 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 298 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium lentum 

(B) STRAIN: ATCC 43055 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 

CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 100 

TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAG CTCGAC GGACGGTCTC 15 0 

60 GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 200 
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GGGTCCCGAG ACCCTGGGTC GCATCTGGAA CGTCATGGGC GAGCCCGTCG 2 50 

ACGAGAAGCC GATGCCCGAG GTGAAGGGCT ACATGCCCAT CCACCGTCCG 3 00 

GCTCCGGACT ACGACGAGCT GTCCACCACC ACCGAGATCT TCGAGACCGG 350 

CATCAAGGCC ATCGACCTCG TCGAGCCCTT CGTCAAGGGC GGCAAGACGG 400 

5 GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA CGGTTATCAT CCAGGAGCTC 450 

ATCAACAACC TGGCCCAGGA GCACGGCGGC ACGTCGGTGT TCACGGGCGT 500 

GGGCGAGCGT ACCCGCGAGG GTACCGACCT CTACCTGGAG ATGAGCGACT 550 

CGGGCGTCAT CAACAAGACC TGCCTCGTGT ACGGTCAGAT GAACGAGCCT 600 

CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG GGCCTCACCG AGGCGGAG T A 650 

10 CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT GTTCGTGGAC AACATCTTCC 7 00 

GCTTCACGCA GGCCGGCTCC GAGGTGTCCG CTCTGCTGGG CCGCATGCCC 750 

TCTGCCGTGG GTTACCAGCC GACGCTGGCA ACCGAGATGG GCGACCTGCA 800 

GGAGCGCATC ACGTCGACGT CCACCGGCTC CATCACGTCC GTG 843 

15 

2 ) INFORMATION FOR SEQ ID NO: 299 

( i ) SEQUENCE CHARAC TER I STICS : 
2 0 (A) LENGTH: 829 bases 

(5) TYPE t Ivaeleie acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella americana 

(B) STRAIN: ATCC 33852 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

35 TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 2 00 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 2 50 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 300 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACC GGT AT CAAAGTTATG 3 50 

40 GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 400 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 450 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

45 TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 7 00 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 7 50 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 800 

TCTACCAAAA GTGGTTC TAT CACCTCCGT 829 

50 

2 ) INFORMATION FOR SEQ ID NO: 3 00 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



60 



382 



CA 02307010 2000-05-19 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 
5 (B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 

AACACGCCTA AAGTATATGA TGCTTTAAAT GTAGTAGAAG CTGGTTTAGT 50 

10 ATTAGAAGTT CAGCAACAAA TTGGTGATGG CGTAGTTCGT ACAATTGCTA 10 0 

TGGGATCTAG TGATGGTCTT AGACGTGGTA TGGAAGTTAA GAACACAAAT 150 

GCGCCTATTT CTGTTCCAGT TGGACATGGC ACACTTGGAC GTATCATGAA 2 00 

TGTTTTAGGT GAACCAATTG ATGAAGCTGG TCCAATTGAA TATACTGAGA 25 0 

AAAGATCTAT CCATCAAGCT CCTCCTGCAT ATGATGAGTT AGCATTAAGT 300 

15 ACAGAAATAT TAGAAACAGG TATCAAAGTA GTTGACCTTA TTTGTC C ATT 35 0 

TGCTAAGGGC GGTAAAGTTG GTTTATTTGG CGGTGCAGGT GTTGGTAAAA 400 

CTGTAACGAT GATGGAACTT ATCAACAATA TTGCAAAAGA ACATAGTGGC 450 

T AC TCTGTAT TTTCCGGTGT TGGTGAAAGA ACTCGTGAAG GTAATGACTT 500 

CTACTATGAG ATGAAATATT CTAATGTATT GGATAAAGTA TCATTAGTAT 55 0 

20 ATGGTCAGAT GAATGAGCCG CCTGGAAACA GATTAAGAGT AGCTCTTAGT 600 

GGCTTAACAA TAGCAGAAGG ATTC CGTGAT GAAAAGCGTG ATGTTTTGAT 650 

GTTTATC GAT AACATCTATC GTTATACATT AGCAGGTACA GAGGTATCGG 7 00 

CGCTACTTGG TCGTATGCCA TCTGCTGTGG GTTATCAGCC AACGC TTGC A 75 0 

GCTGAGATGG GTGCTTTACA GGAGCGTATT ACATCTACTA AGACAGGATC 800 

2 5 TATTA 805 



2 ) INFORMATION FOR SEQ ID NO : 3 01 

30 

( i ) SEQUENCE CHARACTERISTICS : 

(A) * LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 25563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

45 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGG AAATAAT GTTGTGAGAA 100 

C AG TAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGG AAGGCGG TTTTAGGAAG 2 00 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 2 50 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 300 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 350 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG 400 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 4 50 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 5 00 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 65 0 

AGATGTATTG TTGTTTATCG AC AATATC TT CCGTTTCACA CAAGCAGGAT 700 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 7 50 

60 CCGAACTTAG CGACAGAAAT GGG AACTTTA CAAGAAAGAA TTACTTCTAC 800 
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AAAATCAGGA TCTATCACTT CGGTA 825 



5 2 ) INFORMATION FOR SEQ ID NO: 302 

{ i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorum subsp. n e croph or urn 

(B) STRAIN: ATCC 25286 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 



ACAATGCATT AAAGGTACAG GTGGGAGAAA GGG AACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGxCAGA ACAGTAGCAA TGGATTCAAC 100 

AG AC GG ATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 15 0 

CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 2 00 

25 GAGCCTGTGG ACGAAAAAGG TC C GAT AG AG ACAGAAGAAT ATTTACCAAT 250 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 3 00 

TTGAAACAGG AATTAAAGTC AT TG ATTTGT TAGCTCCTTA TATTAAAGGA 3 50 

GG AAAAAC AG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 4 00 

TATGGAAC TG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 450 

3 0 TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 500 

1 ■ ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 7 00 

3 5 GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 7 50 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



40 



2 ) INFORMATION FOR SEQ ID NO: 3 03 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 821 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp. polymorphism 

(B) STRAIN: ATCC 10953 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 

GATGAATTGC CTGCAATATA TAATGC TTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 15 0 

60 GATACAGGTA AACCAATTAC AGTAC C AGTT GGTAAAGCTG TTCTTGGTAG 2 00 
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AATATTAAAT GTTTTAGGAG AACCTGTTGA TAATCAAGGT CCTATAAATG 2 50 

CTGAAACATT TTTACCTATT CATAGAGAAG CACCAGAATT TGATGACTTA 3 00 

GAAACTGAAA CTGAAATATT TGAAACAGGA ATAAAAGTTA TAG AC TT ATT 3 50 

AGCACCATAT ATTAAAGGTG GAAAAATAGG ATTATTTGGT GGAGCTGGAG 4 00 

5 TAGGAAAAAC AGTTTTAATA ATGGAACTTA TCAACAACAT TGCAAAAGGA 450 

CATGGAGGAA TTTCAGTTTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

TAGAGACTTA TATGGTGAAA TGACTGAATC AGGAGTTATC ACAAAAACAG 550 

CTC TTGTTT A TGGACAAATG AATGAGCCAC CTGGAGCAAG ACTTAGAGTT 600 

GCATTAACAG GGCTTACTGT TGCAGAAAAC TTTAGAGATA AAGATGGGCA 65 0 

10 AGATGTTCTT CTATTTATAG ATAATATATT TAGATTTACA CAAGCAGGTT 700 

CAGAAGTTTC AGCTTTACTT GGAAGAATAC CATCAGCTGT TGGATATCAA 75 0 

CCAAACCTAG CAACTGAAAT GGGTGCTTTA CAAGAAAGAA TAACATCTAC 800 

AAAATCTGGT TCAATTACAT C 821 

15 

2 ) INFORMATION FOR SEQ ID NO: 3 04 

( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 



30 



TTCCCAGTTG GCTATCTTCC AGATATTTAT AATGCTCTCA AGGTTGATAT 5 0 

CAACACCGTT GGAAACACGG AGGGAGATAC CGTCCACGAG ATTACATTGG 100 

35 AAGTTGAGCA GCACCTTGGT GATTCAACTG TGCGAGCAGT GGCACTTAAG 150 

CCTACGGACG GCTTGGTCCG TGGTGC TTT A GTGCGAGATA CTGGTGGCCC 2 00 

AATTTC TGTG CCTGTTGGAG ATGTTACAAA AGGTCACGTT TTTGACGTAA 250 

CTGGTAACAT TTT AAACGC T AAACCAGGCG AAAACATTGA GGTGACCGAG 3 00 

CGCTGGCCAA TCCACCGCAA CCCACCTGCT TTCGATCAGC TTGAGTCTAA 3 50 

40 GACTCAAATG TTTGAAACAG GCATTAAGGT TATC GATTTG C TT AC GC C TT 40 0 

ACGTTCAGGG CGGAAAGATT GGTCTGTTCG GTGGTGCAGG CGTTGGTAAA 45 0 

AC TGTGTTGA TTCAGGAGAT GATTCAGCGC GTTGCACAGA ACCACGGCGG 500 

TGTGTCTGTG TTTGCTGGCG TTGGCGAACG TACTCGTGAG GGTAACGATT 55 0 

TGATTGGCGA AATGGCTGAG GCTGGCGTTT TGGAGAAAAC AGCGC TTGTC 600 

45 TTTGGTCAGA TGGATGAGCC TCCTGGGACT CGTCTTCGTG TGCCTCTTAC 65 0 

TGCTTTGACT ATGGC TGAGT ATTTCCGTGA TGTTCAGAAT CAGGATGTGT 700 

TGCTGTTTAT CGACAACATC TTCCGCTTTA CTCAGGCAGG TTCTGAGGTT 750 

TCCACGTTGC TTGGTCGTAT GCCTTCTGCA GTTGGTTATC AGCCAAACTT 800 

GGCGGATGAA ATGGGTGCGT TGCAGGAGCG CATTACTTCT ACGCGCGGTC 850 

50 ATTC TATT AC GTCG 864 



2 ) INFORMATION FOR SEQ ID NO : 305 

55 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 

10 TCGAATCAGG GCATATGCCA AATCTATTAA ACGCTTTAGA AGTTTACATA 50 

GAAAAAGGCG ATGGGAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATTGGTGAT AACGTAGTAA GAACAATCGC TATGTCATCT ACTGATGGAT 150 

TAAATAGGGG AGCAGAAGTA GTAGATACAG GAGCACCAAT TACAGTTCCT 2 00 

GTAGGTAACT ACACATTAGG TCGTGTGTTC AACGTATTAG GTGAAGCAGT 2 50 

15 TGACCACGGT GAAGAAGCAG GAGCAGAAGT TCG TAAAG AT TCAATTCACA 3 00 

AAGAAGCTCC AACATTCGAT GAATTATCAA CTCACGTTGA GGTTCTTGAA 350 

ACAGGTATTA AAGTTATCGA CTTACTTGCA CC AT AT ATT A AAGGTGGTAA 400 

AATCGGTCTT TTCGGTGGTG CGGGAGTTGG TAAAACGGTT CTTATCCAAG 450 

AACTTATCAA CAACGTTGCG CAACAACACG GTGGATTATC AGTATTCACA 5 00 

2 0 GGTGTAGGTG AGCGTACTCG TGAAGGAAAT GACTTATACT ATGAAATGAA 550 

AGATTCTGGT GTTATTAACA AAACAGCCAT GGTATTCGGA CAAATGAACG 600 

AACCACCAGG TGCTCGTATG CGTGTAGCAT TAACAGGATT AACAATGGCG 650 

GAATACTTCC GTGATGAAGA AGGACAAGAC GTGCTTCTAT TCATCGATAA 7 00 

CATTTTCCGT TTCACACAAG CAGGTTCTGA GGTTTCTGCG TTATTAGGAC 7 50 

25 GTATGCCATC AGC CGTTGGT TACCAACCAA CACTTGCTAC AGAGATGGGA 800 

CGTTTACAAG AACGTATAAC ATCAACTAAA AAAGGTTCTG TTACATCT 848 



30 2 ) INFORMATION FOR SEQ ID NO: 306 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 

3 5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 306 



TCGAATCAGG GCATATGCCT AATCTAC TAA ACGCTTTAGA AGTTTATATA 50 

GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 200 

50 GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 25 0 

TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 3 00 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 350 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA C C AT ATATTA AAGGTGGTAA 4 00 

GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 450 

55 AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 500 

GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 550 

AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 

AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 65 0 

GAGTACTTCC GTGATGAAGA AGGACAAGAC GTAC T ATT AT TTATCGATAA 7 00 

60 TATCTTCCGT TTCACACAAG CAGGGTCTGA GGTATCTGCA TTATTAGGGC 7 50 

386 



CA 02307010 2000-05-19 



15 



40 



55 



GTATGCCTTC AGCCGTTGGA TATCAACCAA CTCTTGCAAC AGAAATGGGA 8 00 

CGTCTTCAAG AACGTATTAC ATCAACTAAA AAAGGATCTG TTACATCT 848 



2 ) INFORMATION FOR SEQ ID NO: 307 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 813 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 

GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

TAGGTACCTC AGATGGTTTA AAGC GTAGCT TAAAGGTTGT AAATACAGGT 150 

2 5 AACCC TATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 200 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 25 0 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 3 00 

ACTGAATTAC TTGAAACCGG TATCAAAGTT ATTGACTTAA TTTGTCCATT 3 50 

TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 400 

3 0 C CGTTAAT AT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 450 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 5 00 

TTATCATGAA ATG AC GG ATT CTAATGTATT AGATAAAGTA TC AC TAGT AT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 600 

GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 65 0 

35 TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 7 00 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 7 50 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 8 00 

AATCACGGCA GTA 813 



2) INFORMATION FOR SEQ ID NO: 308 



( i ) SEQUENCE CHARACTERISTICS : 
45. (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 

(B) STRAIN: ATCC 33390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 



TGAATTTCCA CAAGATGCAG TGCCAAAAGT TTAC GATGC A TTAAAAGTTG 50 

AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 

60 CGTTGTATCG CATTAGGTGC TTCTGACGGT TTAAAACGTG GTTTAAAAGT 150 
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AGAAAACACG AATGATCCGA TTCAAGTACC GGTAGGCACA AAAACCCTTG 2 00 

GTCGTATCAT GAATGTATTG GGTGAACCAA TTGACGAACA AGGTCCAATC 2 50 

GGTGAAGAAG AGCGTTGGGC TATCCATCGT TCTGCACCAA GCTATGAAGA 3 00 

ACAATCAAAC AGTACGGAAT TATTAGAGAC TGGTATCAAA GTTATCGACT 3 50 

5 TAATTTGTCC ATTCGCAAAA GGTGGTAAAG TTGGTCTATT CGGTGGTGCG 400 

GGTGTAGGTA AAACCGTTAA CATGATGGAA TTAATCCGTA ACATCGCGAT 450 

CGAGCACTCA GGTTACTCCG TATTTGCGGG TGTAGGTGAA CGTACTCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGAAAG ATTCTAACGT ATTAGATAAA 550 

GTATCTTTGG TTTATGGTCA GATGAATGAG CCACCAGGTA ACCGTTTACG 600 
10 TGTTGCGTTA ACTGGTTTAA CCATGGCAGA AAAATTC CGC GATGAAGGTC . 65 0 

GTGATGTATT ATTCTTCGTG GATAATATCT ATCGTTATAC CCTTGCTGGT 7 00 

ACGGAAGTAT CTGCGTTATT AGGTCGTATG CCATCTGCGG TAGGTTACCA 7 50 

ACCAACTCTT GCTGAAGAAA TGGGTGTGTT ACAAGAACGT ATCACTTCAA 800 

CCAAAACAGG TTCTATTACA TCTGTA 826 

15 

2 ) INFORMATION FOR SEQ ID NO; 309 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parahaemolyticus 
30 (B) STRAIN: ATCC 10014 



" (xi) SEQUENCE DESCRIPTION:" SEQ ID NO": 309 

GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

35 GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 100 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 200 

CGTATTGGGT GAGCCGATTG AC G AAAAAGG TCCTATCGGT GAAGAAGCAC 250 

GCTGGGCAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 3 00 

40 ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCCCATT 35 0 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 400 

CCGTAAATAT GATGGAGTTA ATCCGTAACA TCGCGATCGA ACACTCTGGT 450 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 500 

CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

45 ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGC TTTAAC A 600 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGTCGTG ACGTATTATT 650 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 700 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 75 0 

GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 800 

50 TATCACCTC 809 



55 



2 ) INFORMATION FOR SEQ ID NO : 310 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 

10 CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 100 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 2 00 

GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 2 50 

15 GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 3 00 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTATCAAA GTTATCGACT 3 50 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 400 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 450 

TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 5 00 

2 0 AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 55 0 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA ACCGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 65 0 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 7 00 

ACTGAAGTAT CGGCACTTTT AGGC CGTATG CCATCAGCGG TAGGTT ATC A 7 50 

25 GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACATCAA 800 

CCAAAACAGG TTC TATT ACT TCTG 824 



30 2 ) INFORMATION FOR SEQ ID NO: 311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 311 



GCCGTGCCTA AAGTGTATAA CGCACTTGAG GTGAAAGGCG GTGCCACTAA 50 

ACTGGTACTG GAAGTTCAGC AGCAGCTAGG CGGCGGCGTT GTACGCTGTA 100 

TCGCTATGGG TACTTCTGAC GGTCTGCGTC GCGGACTGGA CGTTGTTGAC 150 

CTGGAGCACC CGATTGAAGT CCCAGTAGGT AAAGCGACCT TAGGCCGCAT 200 

50 TATGAACGTA CTGGGTGAGC CAATTGATAT GAAGGGTGAT ATCGGCGAAG 250 

AAGATCGCTG GGCTATTCAC CGTGAAGCTC CAAGCTACGA AGAACTGTCT 300 

AACTCGCAAG AACTGCTGGA AACTGGTATC AAGGTAATGG ACCTGATTTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

55 TCAGGTTACT CTGTATTTGC CGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTATTGGAC AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACTATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAACA TCTACCGTTA TACCTTGGCC GGTACCGAAG 7 00 

60 TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 750 
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15 



40 



55 



CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC 800 
GGGTTCAATC A 811 



2 ) INFORMATION FOR SEQ ID NO: 312 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella klngae 

(B) STRAIN: ATCC 23330 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 

GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 5 0 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

TGCGATGGGT AGTAC CGATG GTTTGAAACG CGGC TTAGCC GTGAACAACA 150 

25 CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 2 00 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 2 50 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 3 00 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 3 50 

CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 400 

30 CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 450 

GTGGTTTGTC "TGTATTTGC A GGCGTGGGTG AACGTACTCG CGAAGGTAAT 500 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGC CAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 600 

TG AC TGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AG AC G AAAAT 650 

35 GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 7 00 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACTTCAA CGCAAACAGG TTCG ATT AC T T 831 



2 ) INFORMATION FOR SEQ ID NO : 313 



( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp, ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 



ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 5 0 

GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 

60 CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG GATGTAAAAG 15 0 

390 
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ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 200 

ATCATGAACG TACTGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 300 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

5 TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 400 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

10 CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATCCGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 

15 



2 ) INFORMATION FOR SEQ ID NO: 314 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella omithinolytica 
3 0 (B) STRAIN : ATCC 31898 

— -{-xi ) SEQUENCE— DESCRIPTION:™ SEQ ~rD~NO :~ 314 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

35 AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

40 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAAC GT AC TCGTGAGGGT 5 00 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 55 0 

45 CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 65 0 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TAT AC C CTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

50 ACCGGTTCTA TC 812 



55 



2) INFORMATION FOR SEQ ID NO : 315 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 

10 GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTACCATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 200 

GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGACATC GGCGAAGAAG 25 0 

15 AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 300 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 350 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 400 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

20 CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGATAAA GTATCCCTGG 550 

TG TATGGCC A GATGAACGAG CCGCCGGGAA ACCGTCTGCG CG^TGCGCTG 6GG 

AGeSSeeTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 7 00 

CCGCACTGCT GGGTC GTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

25 GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATC AC T TCC 813 



30 2 ) INFORMATION FOR SEQ ID NO: 316 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

3 5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella plant i col a 

(B) STRAIN: ATCC 33531 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 



GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA " 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 200 

50 TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 25 0 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 3 00 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 350 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

55 CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 55 0 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGC TGAG AAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGATA AC ATCT ATCG TTATAC CCTG GCCGGTACTG 7 00 

60 AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTATCAGCCG 7 50 
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ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CCTCCACCAA 800 
AACCGGTTCT ATCACTTCCG TA 822 



2 ) INFORMATION FOR SEQ ID NO: 317 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 785 bases 
10 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 

AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 100 

TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 150 

25 CAACGCTGGG TCGTATCATG AACGT AC TGG GTCAACCGGT TGACATGAAA 2 00 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 2 50 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 00 

TTATCGACCT GATGTGTCCG TTCGCCAAGG GCGGTAAAGT TGGTCTGTTC 3 50 

GGCGGTGCGG GTGTAGGTAA AAC TGTAAAC ATGATGGAGC TGATCCGTAA 400 

3 0 CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 45 0 

GTACTCGTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 5 00 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

3 5 CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 7 00 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 750 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 7 85 

40 

2 ) INFORMATION FOR SEQ ID NO : 318 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 759 bases 
45 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 318 

C TGGTGC TGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWCCAT 50 

CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 

TCGAGCACCC AATCGAAGTT C CGGTMGGTA AAGCAACMCT GGGTCGTATC 150 

60 ATGAACGTAC TGGGTCAKCC AGTMGACATG AAAGGCGACA TCGGTGAAGA 2 00 
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AGAGCGTTGG GCTATCCACC GCGCTGCACC 
GCTCTCAGGA ATTGCTGGAA ACCGGTATCA 
CCGTTCGCTA AGGGCGGTAA AGTCGGTCTG 
TAAAACCGTA AACATGATGG AGCTGATCCG 
5 CCGGTTACTC CGTGTTTGCG GGCGTAGGTG 
GACTTCTACC ACGAAATGAC CGACTCCAAC 
GGTATATGGC CAGATGAACG AGCCACCGGG 
TGACCGGTCT GACCATGGCT GAGAAATTCC 
CTGCTGTTCG TCGATAACAT CTATCGTTAC 
10 ATCTGCWCTG CTGGGTCGTA TGCCTTCAGC 
TGGCGGAAGA GATGGGCGTT CTGCAGGAAC 
GGTTCTATC 



TTCCTACGAA GAGCTGTCTA 250 

AAGTTATCGA CCTGATGTGT 3 00 

TTCGGTGGTG CSGGTGTTGG 350 

TAACATCGCG ATCGAGCACT 400 

AACGTACTCG TGAGGGTAAC 450 

GTTATCGATA AAGTATCCCT 500 

AAACCGTCTG CGCGTTGCTC 550 

GTGACGAAGG TCGTGACGTA 600 

ACCCTGGCCG GTACTGAAGT 65 0 

GGTAGGTTAC CAGCCGACCC 7 00 

GTATCACCTC CACCAAGACC 7 50 

759 



2 ) INFORMATION FOR SEQ ID NO: 319 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
20 (B) TYPE: Nucleic acid 

(C) .STRANDEDNESS i Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

3 0 ( xi) S EQUENCE DESCRIPTION: SEQ ID NO : 319 

TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

35 GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 25 0 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 300 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 400 

40 GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

45 AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



50 



2 ) INFORMATION FOR SEQ ID NO: 32 0 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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CA 02307010 2000-05-19 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 0 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

10 TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG AC CTG ATGTG 3 50 

15 TCCGTTCGCG AAGGGC GGT A AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 400 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC C AC G AAATG A CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

20 CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

AC TGC TGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 7 00 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

CGGTTCTATC 810 

25 

2 ) INFORMATION FOR SEQ ID NO : 321 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 
40 (B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 321 

TCGATAAGAA TTTACC TGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

45 GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACTCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTC G AAG A TACTGGCGCT C C AATTTCAG TTCCAGTTGG AGAAGACACT 200 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTC CAGC 250 

CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 300 

50 ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 3 50 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 400 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 450 

TCGCTCAAGA AC AC GGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

55 AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA C TTT AG AG AT 6 50 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 7 00 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 7 50 

TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 

60 ATTACTTCTA CTAAGAAGGG TTC AATTACT TCAA 83 4 

395 
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2) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 

2 0 TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGGAA CTTGGAGACG GAGTTGTGCG 100 

T AC TATTGC C ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AAC TTTGGG A 200 

CGCATTATGG ATGTTC TTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 250 

25 TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 3 00 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 350 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 400 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTG C AATTG 450 

AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 500 

30 GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGC TG AAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

35 CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 800 

AAAACAGGTT CTATTACTTC CATA 824 



40 2 ) INFORMATION FOR SEQ ID NO : 323 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grlmontii 

(B) STRAIN: ATCC 33999 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 200 

CATGAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG 25 0 

396 
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AAGAGCGTTG GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 300 

GGCGCAACCG AGCTGCTGGA GACCGGCATC AAGGTTATCG AC CTG ATTTG 350 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 400 

GTAAAACCGT AAACATGATG GAGCTCATTC GCAACATTGC G AC CGAGC AC 450 

5 TCCGGTTACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGAATCCAA C GTATTGG AC AAGGTGTCGC 550 

TGGTAT AC GG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

TTAACGGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTT ATCGACAACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 7 00 

10 TATCCGCACT GCTGGGCCGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 7 50 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 

GGGGTC TATC ACCTCCGT 818 



15 



25 



30 



2 ) INFORMATION FOR SEQ ID NO : 324 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 835 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 



GTGGAAACTT AGCTGAAATC TACAATGCCC TAGT-TATTGA ATATAAATCT 5 0 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 2 00 

3 5 GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 2 50 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTC AC C 3 00 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 350 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 400 

AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 450 

40 AGCTTATC C A TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 550 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA 7 00 

45 CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 7 50 

GT ATGC CATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



50 

2 ) INFORMATION FOR SEQ ID NO : 325 

( i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

397 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 5 0 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

10 CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 200 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 25 0 

GAGCCAGGTC TTGGC CGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 300 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

TTAAGGTCAT CGACCTGCTG ACCCCGTATG TTAAGGGCGG CAAGATCGGC 400 

15 CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 450 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GC GGAGT ACT 65 0 

20 TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 7 00 

CGTTTCACCC AGGCAGGTTC CGAGGTxTCC ACCCTCCTAG GCGGC-ATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AG AC G AG ATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 82 8 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 326 



( i ) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 822 bases . . 

(B) TYPE: Nucleic -acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 

(B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

45 GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 200 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 2 50 

AAGAAGAACG C TGGTC AATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 3 00 

GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 3 50 

50 TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 450 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTC GTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 55 0 

CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

55 GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 7 00 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

GACCGGCTCT ATCACTTCCG TA 8 2 2 



60 



398 
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2 ) INFORMATION FOR SEQ ID NO: 327 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Branhamella catarrhalis 

(B) STRAIN: ATCC 43628 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 



CCGTGGCGAT 
2 0 AAACCACCCT 
ATTGCCATGG 
TTCAGGTGCA 
TTATGGATGT 
GAACAAAAAT 

2 5 AAATAGTACA 

GTCCATTTGC 
GGTAAGACCG 
CTCAGGTCTG 
ATG AC TTCT A 

3 0 TTTACTC AGT 

AGGAAACCGT 
TCCGTGATGA 
TTCGTTGATA 
AC TTTTAGGT 
3 5 AAGAGATGGG 
ATTA 



GTCCCCCAAA 
TGAAGTCCAA 
GTTCTACCGA 
CCCATTTCGG 
CCTAGGTCGC 
GGTCCATTCA 
GAACTTTTAG 
CAAAGGTGGT 
TTAACATGAT 
TCGGTTTTTG 
CCATGAAATG 
CAAAAGTTGC 
CTGCGTGTTG 
AAAAGACGAA 
ATATTTATCG 
CGTATGCCAT 
CTTGCTACAA 



TCTTTGATGC 
CAACAGTTAG 
AGGCTTAAAG 
TACCAGTCGG 
CCAATCGATG 
TCGTGAAGCA 
AAACAGGCAT 
AAAGTC GGTC 
GGAGCTTATC 
CTGGTGTGGG 
CAAGAAGCAG 
CATGGTTTAT 
CCTTAACTGG 
GCAACGGGCA 
TTACACATTG 
CTGCGGTAGG 
GAGCGTATCA 



ACTTCATGTT 
GTGATGGTGT 
CGTGGCTTGC 
TCAAGCAACA 
AAGCAGGTCC 
CCAAGTTATG 
CAAAGTGATT 
TGTTCGGTGG 
AATAATATCG 
TGAGCGTACT 
GCGTTGTTAA 
GGTCAGATGA 
TTTGACCATG 
AAGGCCGTGA 
GCAGGTACTG 
TTATCAGCCG 
CCTCCACCCA 



GATGGTACTG 50 

GGTGCGTACC 100 

CTGTCTCTAA 15 0 

CTGGGTCGCA 200 

GGTAAATGCT 250 

ATGAACAGTC 300 

GATTTGCTTT 350 

TGCTGGTGTT 400 

CCCTAAAACA 450 

CGTGAGGGTA 500 

TACCGAAGAT 550 

ATGAGCCACC 600 

GCAGAGTATT 650 

TGTTCTGCTG 7 00 

AGGTATCAGC 750 

AC TTTGGC C G 800 

ATCAGGCTCA 850 
854 



40 2 ) INFORMATION FOR SEQ ID NO: 328 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

55 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 

6 0 TCATGGACGT TTTAGGACAC CCAATCGATG AGGCAGGTCC GGTAGAGCAT 250 

399 
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AGTAACACTT GGGCGATTCA CCGTGAAGCG 
AAACTCTACT GAACTTTTAG AAACCGGTAT 
GCCCATTTGC TAAAGGTGGT AAAGTCGGTC 
GGTAAAACCG TTAACATGAT GGAACTTATC 
5 CTCAGGTTTA TCGGTATTTG CTGGTGTAGG 
ATGACTTCTA CCACGAGATG AAAGACTCAA 
ATGGTGTATG GTCAGATGAA TGAGCCACCA 
CCTGACAGGT TTAACCATGG CAGAATACTT 
ACGGTAAAGG TCGTGACGTA TTATTGTTCG 
10 ACGCTAGCGG GTACCGAAGT ATCAGCATTA 
AGTAGGGTAT CAGCCAACGC TTGCAGAAGA 
GTATTACTTC AACCCAATCA GGCTCTATTA 



CCAAGCTATG ATGAACAATC 3 00 

TAAAGTAATT G AC TT AC TAT 3 50 

TGTTC GGTGG CGCGGGTGTT 400 

AATAACATCG CAAAAGCACA 450 

TGAGCGTACT CGTGAAGGTA 500 

AC GT AC TTG A TAAAGTTGCG 550 

GGAAACCGTT TACGTGTTGC 600 

CCGTGACGAA AAAGATGAAA 650 

TTGACAATAT TTATCGTTAC 700 

TTAGGTCGTA TGCCATCTGC 7 50 

GATGGGTGTA CTACAAGAAC 800 

C 831 



15 

2 ) INFORMATION FOR SEQ ID NO : 329 

{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 5 bases 
20 (B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morgan ii subsp. morgan ii 

(B) STRAIN: ATCC 25830 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 

CG^ATTTCCT CAGGATGCAG taccgaaagt GTACGATGCG CtTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 15 0 

35 TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTG ATCC GAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 3 00 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 4 00 

40 GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 450 

CATCGCGATC GAGCACTCCG GTTACTC TGT ATTCGCAGGG GTCGGTGAGC 500 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTC CGTG 650 

45 ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTAT AC C 7 00 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 7 50 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 83 5 

50 

2 ) INFORMATION FOR SEQ . ID NO: 33 0 

( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

400 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Pantoea agglomerans 

(B) STRAIN : ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 



GACGCGGTAC 
GCGTCTGGTG 

10 CCATCGCAAT 
GACCTGCAGA 
TATCATGAAC 
AAGAAGATGG 
TCTTATGAAG 

15 GGTTATCGAC 
TCGGTGGTGC 
AACATCGCGG 
GCGTACTCGT 
TTATCGATAA 

20 AACCGTCTGC 
TGATGAAGGT 
CCCTGGCCGG 
GTAGGTTACC 
T ATT AC C TC C 

25 



CGCAAGTGTA 
CTGGAAGTTC 
GGGTACGTCT 
AACCGATTCA 
GTTCTCGGCG 
CAGCGCAGTA 
ATCAGTCTAA 
CTGATGTGTC 
GGGTGTAGGT 
CTGAACACTC 
GAGGGTAACG 
AGTTGCACTG 
GCGTAGCACT 
CGCGACGTTC 
TACAGAAGTT 
AGCCAACGCT 
AC C AAGACCG 



CAGCGCCCTC 
AGCAGCAGCT 
GACGGCCTGA 
GGTACCCGTC 
AGCCAATCGA 
GAGATCGCCT 
CTCGCAGGAA 
CGTTTGCTAA 
AAAAC CGTC A 
AGGTTACTCA 
ACTTCTACCA 
GTCTATGGCC 
GACCGGTCTG 
TGCTGTTCAT 
TCTGCACTGC 
GGCAGAAGAG 
GTTC 



GAGGTTATGA 
CGGCGGCGGC 
AGCGTGGTCT 
GGTAAAGCGA 
TATGAAAGGC 
CTATTCACCG 
CTGCTGGAAA 
AGGCGGTAAA 
ACATGATGGA 
GTGTTTGCCG 
CGAAATGACT 
AGATGAACGA 
ACCATGGCGG 
. CGATAAC ATC 
TGGGTCGTAT 
ATGGGTGTGT 



ATGGTGATGC 50 

GTAGTACGTA 100 

GAG CGTC AAC 150 

CCCTGGGCCG 2 00 

GAGCTGAAAG 2 50 

CGCAGCCCCT 300 

CCGGCATCAA 350 

GTCGGTCTGT 400 

ACTGATCCGT 450 

GTGTGGGTGA 500 

GACTCTAACG 550 

GCCGCCGGGT 600 

AAAAATTCCG 65 0 

TACCGTTATA 7 00 

GCCATCTGCG 7 50 

TGCAGGAGCG 800 
824 



2 ) INFORMATION FOR SEQ ID NO : 3 31 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pantoea dispersa 
40 (B) STRAIN: ATCC 14589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 331 

TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

4 5 GTACAGCAGC AGC TGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 200 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 250 

TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 300 

50 CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 350 

TGTCCGTTCG CGAAGGGCGG TAAAGTC GGT CTGTTCGGTG GTGCGGGTGT 400 

AGGT AAAAC C GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 450 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

55 GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 7 00 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGC GGT AGGT T ATC AGC C AA 750 

CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 

60 ACCGGTTC 808 

401 
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2) INFORMATION FOR SEQ ID NO : 332 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 332 

20 GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 5 0 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 100 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 15 0 

AACCCAATTT CTGTTCCAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 200 

CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GC AG AAG AG A 250 

25 ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA ATCTAACAGT 3 00 

AC TG AACTTT TAG AAAC GGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 350 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 400 

CCGTCAATAT GATGGAATTA ATCCGTAACA TCGCAATTGA GCACTCAGGT 450 

TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 500 

30 CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 600 

GGCTTAACTA TGGCGGAAAA ATTC CGTG AT GAAGGTCGTG ATGTCTTATT 650 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA G AAGTTTC TG 7 00 

CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 750 

35 GAAGAAATGG GTGTTCTGCA AGAGCGTATT ACCTCAACCA AAACAGGTTC 800 

TATTA 805 



40 2 ) INFORMATION FOR SEQ ID NO: 3 33 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 49100 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 



TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 200 

60 GGGCCGTATT ATGAACGTGC TCGGTCAGCC AATCGATATG AAAGGCGATA 250 

402 
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TTGGTGAAGA AGAGCGTTGG GCTATTCACC 
GATTTATCTG GCGCCAATGA ACTGCTGGAA 
CCTGATTTGT CCGTTTGCTA AAGGTGGTAA 
CGGGTGTAGG TAAAACCGTA AACATGATGG 
5 AC TG AGC ACT CAGGTTACTC CGTATTCGCC 
TGAAGGTAAT GACTTCTACC ACGAAATGAC 
AAGTATCTCT GGTTTATGGC CAGATGAACG 
CGCGTGGCGT TAACGGGTTT GACCATGGCT 
TCGTGACGTT C TGTTATTT A TC G AT AAC AT 
10 GTACCGAAGT ATCAGCACTG TTGGGGCGTA 
CAGCCAACGT TAGCAGAAGA GATGGGTGTG 
AAC C AAAAC C GGTTCAATCA CTTCTGTA 



GTGAAGCACC AAGTTATGAA 3 00 

ACGGGTATCA AGGTTATCGA 3 50 

AGTTGGTCTG TTTGGTGGTG 400 

AGCTGATTCG TAACATTGCG 45 0 

GGTGTAGGGG AACGTACCCG 500 

CGAATCAAAC GTACTGGATA 550 

AGCCACCAGG AAACCGTCTG 600 

GAAAAATTCC GTGATGAAGG 650 

TTATCGCTAT ACCTTAGCCG 7 00 

TGC CATC AGC GGTAGGTTAT 750 

TTGCAGGAAC GTATTACTTC 800 

828 



20 



25 



15 

2 ) INFORMATION FOR SEQ ID NO : 3 34 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 

GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 100 

CAATGGGTAC ATCAGACGGT TTAAGCCGTG GCTTAAAGGT TGAAGATTTA 15 0 

35 GGCCACCCAA TTGAAGTACC AGTAGGTAAA GCGACTTTAG GACGTATCAT 2 00 

GAACGTTCTG GGTACACCTA TTGATATGAA AGGTGAGATT GAAACCGAAG 25 0 

AGCGTTGGTC AATCCACCGT GAAGCACCAA CTTACGAAGA GTTATCAAAC 3 00 

TCTCAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 3 50 

ATTTGC TAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 400 

40 AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACTCA 45 0 

GGTTACTCTG T ATTTGC TGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

C.TTCTATC AT GAAATGACAG ATTCTAACGT TC TTG AC AAA GTATCGTTAG 550 

TTTACGGTCA GATGAATGAG CCACCAGGAA ACCGTCTGCG TGTTGCACTG 600 

ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

45 GTTATTCGTC GATAACATCT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 7 00 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTAC C A ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA C C AAAAC AGG 800 
TTCTATC 



50 



2 ) INFORMATION FOR SEQ ID NO : 335 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 811 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

403 



807 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Proteus vulgaris 

(B) STRAIN: ATCC 13315 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33 5 

CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

10 ATGGGTACAT CAGACGGTTT AAGCCGTGGC TTAAAAGTTG AAAAC TTAGG 15 0 

CCACCCAATT GAAGTACCAG TAGGTAAAGC AAC AC TGGGA CGTATCATGA 2 00 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 2 50 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 300 

TCAAGAACTA CTAGAAACCG GTATCAAAGT AATGGACTTA ATCTGTCCGT 35 0 

15 TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 4 00 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 450 

TTATTC TGT A TTTGCAGGTG TTGGTGAGCG T AC TCGTG AG GGTAACGACT 500 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 600 

2 0 GGGTTT AAC C ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

m^rprronrrcGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 7 00 

GC AC TGTT AG GCCGTATGCC ATCAGCAGTA GGTTAC C AAC CAACATTGGC 7 50 

TGAAGAGATG GGTGTTC TGC AAGAACGTAT CACTTCAACC AAAAC AGGTT 800 

CAATCACCTC T 811 

25 



2) INFORMATION FOR SEQ ID NO: 336 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 
~(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia alcalifaciens 
40 (B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

45 AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 200 

GACGTATCAT GAACGTTCTG GGTG AAC C AA TCGACATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 3 00 

50 ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTC TGTT CGGTGGTGCG 400 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 50 0 

AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 55 0 

55 GTATC AC TGG TTTATGGC C A GATGAACGAG CCACCAGGAA ACCGTCTGCG 600 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GC TGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 7 00 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 7 50 

ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

60 CTCAAA 

404 



806 
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2 ) INFORMATION FOR SEQ ID NO: 3 37 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 

2 0 TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 100 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA AC TT AGAGC A CCCAATTGAA GTACCAGTAG GTAAAGCAAC 200 

TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 250 

2 5 ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGC TGCTG GAAACCGGTA TCAAAGTAAT 350 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 450 

GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 500 

30 TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGCCAGATGA ATGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGTTGACTGG TCTG AC TATG GCTGAAAAAT TCCGTGACGA 650 

AGGTC GTG AC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 7 50 

3 5 TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 83 0 



40 2 ) INFORMATION FOR SEQ ID NO: 338 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 812 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 



AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 200 

60 CATGAACGTT CTGGGTGACC C TATTG AT AT GAAAGGTGAT ATCGGCGAAG 2 50 

405 
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AAGAGCGCTG GTCTATTCAC CGTTCAGCGC 
AACTCAACAG AAC TGC TAG A AACCGGTATC 
TCCATTCGCG AAAGGTGGTA AAGTTGGTCT 
GTAAAACAGT AAACATGATG GAACTGATCC 
5 TCAGGTTACT CAGTATTCGC TGGTGTTGGT 
CGACTTCTAT CATGAAATGA CTGATTCTAA 
TGGTTTATGG CCAGATGAAC GAGCCACCAG 
CTGACTGGTC TGACTATGGC TGAAAAATTC 
ACTGCTGTTC GTTGACAACA TTTATCGTTA 
10 TATCAGCACT GTTAGGTCGT ATGCCTTCAG 
TTGGCAGAAG AGATGGGTGT TCTACAAGAA 
CGGTTCTATC AC 



CAAGCTATGA AGAATTAGCT 3 00 

AAAGTAATGG ACTTAATCTG 3 50 

GTTCGGTGGT GCGGGTGTTG 400 

GTAACATCGC GATTGAGCAC 450 

GAGCGTACCC GTGAAGGTAA 500 

CGTTCTGGAT AAAGTATCAC 55 0 

GAAACCGTCT GCGTGTTGCG 600 

CGTGACGAAG GTCGTGACGT 650 

TACACTGGCA GGTACTGAAG 700 

CGGTAGGTTA TCAGCCAACA 750 

CGTATCACTT CTACCAAAAC 800 

812 



15 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 339 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
(D j TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Providencia stuartu 

(B) STRAIN: ATCC 33672 

30 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 339 

TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

35 TAAAAATTTA GAACACCCAA TTG AAGTAC C AGTAGGTAAA GCAACACTCG 200 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 250 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 300 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 3 50 

TGATCTGTCC ATTC GC G AAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 400 

40 GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTC TATC AT GAAATGACAG ATTCAAACGT TC TTG AC AAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

45 GTGATGTTCT GTTGTTCGTG GATAACATCT ATC GTT AT AC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 7 50 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 800 

CTAAGACAGG TTCTATCAC 819 



50 



2 ) INFORMATION FOR SEQ ID NO: 340 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

406 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

AACCGCACTG ACGTGCCTCA AATTTATGAC GCGTTAGTTG TAGATGGTAC 50 

CGAAACCACC CTAGAAGTTC AGCAGCAGCT GGGCGATGGT GTGGTACGTA 100 

10 CTATTGCAAT GGGATCTACT GAAGGTCTTA AGCGTGGTTT ACCAGTAACA 150 

AACACTGGTG CCCCAATTAC AGTTCCAGTA GGTGATGCGA CTTTAGGTCG 2 00 

TATTATGGAC GTTTTAGGTC GTCCAATTGA CGAACAAGGT CCAGTTAATT 2 50 

CTGAAGACCA TTGGTCAATC CACCGTCAAG CGCCATCATA TGATGAGCAA 300 

GCTAACAGTA CTGACCTATT AGAGACAGGT ATTAAAGTAA TTGACTTACT 350 

15 TTGTCCGTTC GCTAAAGGGG GTAAAGTTGG TCTGTTCGGT GGTGCCGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAATTGA TTAATAACAT CGCTCTTAAG 450 

CACTCAGGTT TATCAGTATT CGCTGGTGTG GGTGAGCGTA CTCGTGAAGG 500 

TAACGACTTC TACCACGAGA TGCAAGAAGC GGGTGTTGTT GACGTTGAAA 550 

ACTTCACCAA CTCAAAAGTT GCGATGGTTT ATGGTCAGAT GAATGAGCCA 600 

20 CCAGGTAACC GTTTACGTGT TGCGTTAACC GGTCTGACTA TGGCTGAGTA 650 

CTTCCGTGAT CAAAAAGATG AAAACGGTAA AGGTAAAGAC GTTCTATTAT 7 00 

TCGTTGATAA CATCTACCGC TACACGCTAG CCGGTACTGA AGTATC AGC A 75 0 

CTTCTAGGTC GTATGCCATC AG C AGTAGGT TATCAGCCAA CACTAGCGGA 800 

AGAGATGGGT GTACTACAAG AGCGTATTAC TTCAACTCAG ACTGGTTCTA 850 

25 TTACTTC 90 



30 



2 ) INFORMATION FOR SEQ ID NO : 341 

( i ) SEQUENCE CHARAC TERISTIC S: - 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Rahnella aquatllls 

(B) STRAIN: DSM 4594 

{xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 341 

45 GTTCCCTCAG GATGCAGTAC CGAACGTGTA CAATGCTCTT GAGGTAGAAA 5 0 

ACGGTACCTC CAAACTGGTG CTGGAAGTTC AGCAACAGTT AGGCGGCGGC 100 

GTTGTTCGTT GTATCGCAAT GGGTACCTCA GACGGCCTGC GTCGCGGTCT 150 

GAAAGTGAAC AACCTGGAAC ACCCAATTGA AGTACCGGTA GGTAAAGCGA 200 

CTCTGGGTCG TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAAGGT 250 

50 GAAATCGGCG AAGAAGAACG TCGTGCTATT CACCGTGCTG CGCCTTCTTA 300 

TGAAGAGCTG GCAAACTCCC AGGAATTGCT GGAAACCGGT ATCAAAGTTA 3 50 

TGG AC CTG AT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 400 

GGTGCGGGTG TAGGTAAAAC TGTGAACATG ATGGAGCTGA TC CGTAAC AT 450 

TGCGATC GAG CACTCCGGTT ATTC TGTGTT TGCAGGCGTG GGTGAACGTA 500 

55 CTCGTGAGGG TAACGACTTC TACCACGAAA TG AC TGATTC CAACGTTATC 550 

GACAAAGTTT CCCTGGTGTA TGGCCAGATG AATGAGCCAC CAGGTAACCG 600 

TCTGCGCGTT GCACTGACCG GCCTGACCAT GGCGGAAAAA TTCCGTGATG 650 

AAGGTCGTGA CGTACTGCTG TTCGTTGACA ACATTTACCG TTAC AC CCTG 7 00 

GCCGGTACCG AAGTGTCCGC AC TTCTGGGC CGTATGCCAT CGGCGGTAGG 7 50 

60 TTATC AGC C A ACGCTGGCGG AAGAGATGGG CGCTCTGCAA GAACGTATCA 800 

407 
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CCTCGACCAA AAGTGGTTCT ATCACCTCCG TA 832 



5 2 ) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. an zonae 

(B) STRAIN: ATCC 13314 



20 



40 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 15 0 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

25 GCTGGGTCGT ATCATGAACG TCCTGGGCGA AC CGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 45 0 

30 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

^TCGTGAGGGT AACGACTTCT _ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 65 0 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TAT AC CCTC G 7 00 

35 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



2 ) INFORMATION FOR SEQ ID NO: 343 



( i ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 820 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 
CCATCGCGAT GGGGTCTTCT GACGGTCTGC GTCGCGGTCT GGATGTAAAA 150 

408 
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GATCTCGAAC ACCCGATCGA AGTCCCGGTA 
TATCATGAAC GTCCTGGGCG AACCGGTCGA 
AAGAAGAGCG TTGGGCGATT CACCGCGCAG 
TCAAACTCTC AGGAACTGCT GGAAACCGGT 
5 GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG 
TAGGTAAAAC CGTAAACATG ATGGAGCTTA 
CACTCCGGTT ACTCAGTGTT TGCGGGCGTA 
TAACGACTTC TACCACGAAA TGACCGACTC 
CCCTGGTGTA TGGCCAGATG AACGAGCCGC 
10 GCACTGACCG GCCTGACCAT GGCGGAGAAA 
TGTACTGCTG TTCGTCGATA ACATCTATCG 
AAGTATCCGC ACTGC TGGGC CGTATGCCTT 
ACTCTGGCGG AAGAGATGGG CGTTCTGCAG 
AACCGGTTCT ATCACCTCCG 

15 



GGTAAAGCCA CGCTGGGTCG 200 

CATGAAAGGC GAGATCGGCG 250 

CACCTTCCTA CGAAGAGTTG 300 

ATCAAAGTTA TCGACCTGAT 3 50 

TCTGTTCGGT GGTGCGGGTG 4 00 

TTCGTAACAT CGCGATCGAG 450 

GGGGAACGTA CTCGTGAGGG 500 

CAACGTTATC GATAAAGTAT 550 

CGGGAAACCG TCTGCGCGTT 600 

TTCCGTGACG AAGGTCGTGA 650 

TTACACCCTG GCCGGTACGG 7 00 

CCGCAGTAGG TTACCAGCCG 750 

GAACGTATCA CCTCCACCAA 800 

820 



2 ) INFORMATION FOR SEQ ID NO : 344 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: , 

(A) ORGANISM: Salmonella choleraesuis subsp. diarxzonae 

30 (B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

TTCCCTCAGG ATGC CGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

35 TGGTAATGAG AAGC TGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 300 

40 GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

45 ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 7 00 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 800 

50 CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 



2 ) INFORMATION FOR SEQ ID NO: 345 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

409 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , , , 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 345 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCOTACCTAC CATCGC G ATG GGGTCTTCTG AC GGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATC TCGAAC A CCCGATCGAA GTCCCGGTAG- GTAAAGCTAC 200 

GCTGGGTCGT ATC ATG AAC G TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 300 

Sa^gagttgt CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC AC TC CGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT AC C ACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

Jatcagccga CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG AC CGGTTC TA TCACCTCCGT A 83 ! 



2 ) INFORMATION FOR SEQ ID NO: 346 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . . 

(A) ORGANISM: Salmonella choleraesuis subsp. mdica 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TC GTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTC CGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 7 00 

CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 7 50 

410 
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10 



15 



20 



TATCAGCCAA CTCTGGCGGA AG AG ATGGG C GTTCTGCAGG AACGTATCAC 800 
CTCCACCAAA ACCGGTTCTA TCACCTCCG 829 



2 ) INFORMATION FOR SEQ ID NO: 347 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: m 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 915 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 



GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

2 5 TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 200 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

3 0 TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT .GTTCGGTGGT GC GGGTGT AG 400 

GTAAAACCGT-AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

C G ACTTC T AC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

3 5 CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

AC TGC TGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 7 00 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

CGGTTCTATC ACCTCCG 817 

40 

2 ) INFORMATION FOR SEQ ID NO: 348 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
55 serotype Paratyphi B 

(B) STRAIN: ATCC 8759 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 



411 
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GCTGGTGCTG GAAGTTCAGC 
TCGCGATGGG GTCTTCTGAC 
CTCGAACACC CGATCGAAGT 
CATGAACGTC CTGGGCGAAC 
5 AAGAGCGTTG GGCGATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCG AAGGGCGGTA 
GTAAAACCGT AAACATGATG 
TC CGGTT ACT CTGTGTTTGC 

10 CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
TTGAC CGGTC TGACCATGGC 
ACTGCTGTTC GTCGATAACA 
TATCCGCACT GCTGGGCCGT 

15 CTGGCGGAAG AGATGGGCGT 
CGGTTC 



AGCAGCTTGG CGGCGGTATT 
GGTCTGCGTC GCGGTCTGGA 
CCCGGTAGGT AAAGCCACGC 
CGGTCGACAT GAAAGGCGAG 
CGCGCAGCGC CTTCCTACGA 
AAC CGGTATC AAAGTTATCG 
AAGTCGGTCT GTTCGGTGGT 
GAGCTTATCC GTAACATCGC 
GGGCGTAGGT G AAC GT AC TC 
CCGACTCCAA CGTTATCGAT 
GAGCCGCCGG GAAACCGTCT 
GGAGAAATTC CGTGACGAAG 
TCTATC GTT A CACCCTGGCC 
ATGCCTTCCG CAGTAGGTTA 
TCTGCAGGAA CGTATCACCT 



GTGCGTACCA 


100 


TGTAAAAGAT 


150 


TGGGTCGTAT 


200 


ATCGGCGAAG 


250 


AGAGTTGTCA 


300 


ACCTGATGTG 


350 


GCGGGTGTGG 


400 


GATCGAGCAC 


450 


GTGAGGGTAA 


500 


AAAGTATCCC 


550 


GCGCGTTGCA 


600 


GTCGTGAC GT 


650 


GGTACGGAAG 


700 


CCAGCCGACT 


750 


CCACCAAAAC 


800 




806 



20 2 ) INFORMATION FOR SEQ ID NO : 349 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 ( vi ) ORIGINAL SOURCE : . 

-(-A) -ORGANISM: Salmonella chol era esui s subsp . salamae 



35 



(B) STRAIN: ATCC 43972 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 5 0 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 15 0 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

4 0 GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

"~ AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TC AAAGTT AT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

45 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

*~ TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

50 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 75 0 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA AC CGGTTC TA TCACCTCCGT A 83 1 



2 ) INFORMATION FOR SEQ ID NO : 350 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 823 bases 
60 (B) TYPE: Nucleic acid 

412 
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10 



40 



(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 

(B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 350 



TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

15 TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATG AAAGGC G 2 50 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 3 00 

GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAAC CGGTA TCAAAGTTAT 3 50 

2 0 CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAAC C GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

2 5 CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 6 50 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 7 00 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 7 50 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG ACCGGTTCTA TCA 82 3 

30 

2) INFORMATION FOR SEQ ID NO: 351 

35 { i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
45 serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 

50 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA. 5 0 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TTGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

55 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAAC C GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCAGTGTTT GCGGGCGTAG GGGAACGTAC 500 

60 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

413 
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ATAAAGTATC CCTGGTGTAT GGCCAGATGA 
CTGCGCGTTG CATTGACCGG TCTGACCATG 
AGGTCGTGAC GTACTGCTGT TCGTCGATAA 
CCGGTACGGA AGTATCCGCA CTGCTGGGCC 
5 TACCAGCCGA CTCTGGCGGA AGAGATGGGC 
CTCCACCAAA ACCGGTTCTA TCACCTCCGT 



ACGAGCCGCC GGGAAACCGT 600 

GCGGAGAAAT TCCGTGACGA 650 

CATCTATCGT TACACCCTGG 7 00 

GTATGCCTTC CGCAGTAGGT 750 

GTTCTGCAGG AACGT ATC AC 800 

a 831 



10 2) INFORMATION FOR SEQ ID NO: 352 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
!5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: , , _ 

(A) ORGANISM: Sferlmanel 1 a chcleraesuJLS subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

3 0 CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

1 GATGAAGGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 40 0 

3 5 GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 45 0 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 5 00 

CGACTTCTAC C AC G AAATG A CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

40 AC TGC TGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 7 00 

T ATCC GC ACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 81 u 



45 



2 ) INFORMATION FOR SEQ ID NO: 3 53 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia flcaria 

(B) STRAIN: ATCC 33105 



60 



414 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 53 



GATGCCGTAC 
GAAACTGGTG 
5 GTATCGCAAT 
AACCTGGAAC 
TATCATGAAC 
AAGAAGAGCG 
TCCAACTCCC 

10 TTGTC CGTTC 
TGGGCAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTTTA 

15 GCATTGACCG 
CGTTCTGCTG 
AAGTGTCCGC 
ACGCTGGCGG 
GACCGGTTCC 

20 



CAAAAGTGTA 
CTGGAAGTTC 
GGGGACCTCT 
ACCCGATTGA 
GTATTGGGCG 
TTGGGCGATT 
AGGACCTGCT 
GCCAAGGGCG 
CGTAAACATG 
ATTCCGTGTT 
T AC C ACG AAA 
CGGCCAGATG 
GCCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTCTGC 
AGTGCCGGTG 
AACCAATCGA 
CACCGTCCTG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGAACGACTC 
AACGAGCCGC 
GGCGGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTC TGCAA 



GAGGTAGAAA 
GGGCGGTGGC 
GTCGCGGTCT 
GGTAAAGCTA 
CATGAAAGGC 
CGCCAAGCTA 
ATCAAGGTAA 
TCTGTTCGGT 
TCCGTAACAT 
GGCGAGCGTA 
CAACGTTCTG 
CGGGTAACCG 
TTCCGTGACG 
TTACACCCTG 
CCGCGGTAGG 
GAACGTATCA 



ACGGTACCGA 50 

GTCGTTCGCT 100 

GAAAGTGAAC 150 

CCCTGGGCCG 200 

GAGATCGGCG 250 

CGAAGAGCTG 3 00 

TGGACCTGAT 350 

GGTGCGGGCG 400 

CGCGATCGAG 450 

CTCGTGAGGG 500 

GACAAAGTAT 55 0 

TCTGCGCGTT 600 

AAGGCCGCGA 650 

GCCGGTACCG 700 

TTATCAGCCA 7 50 

CCTCGACCAA 800 
820 



2 ) INFORMATION FOR SEQ ID NO: 354 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

_ „(ii)_MOLECULE:TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 
35 (B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

40 GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 15 0 

GACCTGCAGC ACCCAATTGA AGT AC CGGTA GGTAAAGCTA CCCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 2 50 

AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 3 00 

45 TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTC TGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 55 0 

50 CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTC GTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

55 GACTGGTTCA ATCACC 816 



2 ) INFORMATION FOR SEQ ID NO: 355 

60 

415 
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{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Serratia grimesii 

(B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 

15 GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTC TGC GTCGCGGTCT GAAAGTCACA 150 

GACCTGGACC ACC CAATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 25 0 

2 0 AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 300 

QCC AACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 350 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTC TGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

25 TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTG AC CAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 700 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 750 

30 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



35 2 } INFORMATION FOR SEQ ID NO: 356 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 



ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 15 0 

ATCTGGAACA C CC AATTG AA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 200 

55 ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 300 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 3 50 

TGTC CGTTCG ' CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGC TGGTGT 400 

TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 45 0 

60 ACTCCGGTTA TTCCGTGTTT GCAGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

416 
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AACGACTTCT ACCACGAAAT 
CCTGGTTTAC GGCCAGATGA 
CTCTGACCGG TCTGACCATG 
GTTCTGCTGT TCGTTGATAA 
5 AGTGTCCGCA CTTCTGGGCC 
CGCTGGCGGA AGAGATGGGC 
ACCGGTTCTA TCACTTCCG 



GAACGACTCC AACGTACTGG 
ACGAGCCACC GGGTAACCGT 
GCGGAGAAAT TCCGTGACGA 
CATTTACCGT TATACCCTGG 
GTATGCCATC TGCGGTAGGT 
GTTCTGCAAG AACGTATCAC 



ACAAAGTATC 550 

CTGCGCGTTG 600 

AGGCCGCGAC 650 

CCGGTACCGA 7 00 

TATCAGCCAA 750 

CTCTACCAAG 800 
819 



10 



20 



45 



60 



2 ) INFORMATION FOR SEQ ID NO: 357 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 805 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

{ vi ) ORIGINAL SOURCE : . 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 357 

TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

3 0 GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 200 

GTCGTATCAT G AAC GTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 250 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 300 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 350 

TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 400 

35 GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 45 0 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 55 0 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

CGTTGCGCTG AC C GGTCTG A CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

40 GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 7 50 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 800 

CCAAG 805 



2 ) INFORMATION FOR SEQ ID NO: 358 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odorlfera 

(B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 



GATGCCGTAC 
AACTCTGGTG 
5 GTATCGC TAT 
GATCTGCAAC 
TATCATGAAC 
AAGAAGAGCG 
TCCAACTCCC 

10 CTGCCCGTTT 
TTGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTTTA 

15 GCGCTGACCG 
CGTTCTGCTG 
AAGTGTCTGC 
ACGCTGGCGG 
G AC CGGTTC T 

20 



CAAAAGTGTA 
CTGGAAGTTC 
GGGCACCTCC 
ACCCAATCGA 
GTATTGGGTC 
TTGGGCGATT 
AGGATCTGCT 
GCCAAGGGTG 
CGTAAACATG 
ATTCAGTGTT 
T ACC AC G AAA 
CGGCCAGATG 
GTCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTTTGC 
AGTCCCGGTT 
AACCAATCGA 
CACCGCGCGG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGACCGACTC 
AACGAGCCAC 
GGCCGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTCTGCAA 
TA 



GAGGTAGAAA 
GGGCGGTGGC 
GTCGCGGCCT 
GGCAAGGCAA 
CATGAAAGGC 
CGCCAAGCTA 
ATCAAGGTTA 
TCTGTTCGGT 
TCCGTAACAT 
GGTGAGCGTA 
CAACGTACTG 
CGGGTAACCG 
TTCCGTGACG 
TTACACCCTG 
CGGCGGTAGG 
GAACGTATCA 



ACGGTACCGA 50 

GTGGTTCGTT 100 

GAAAGTGAAC 150 

CGCTGGGTCG 200 

GACATCGGCG 250 

CGAAGAACTG 300 

TGGACCTGAT 350 

GGTGC GGGTG 400 

CGCGATCGAG 450 

CTCGTGAGGG 50 0 

GACAAGGTTT 550 

TCTGCGCGTT 600 

AAGGTCGTGA 650 

GCCGGTACCG 7 00 

TTATCAGCCA 750 

CCTCGACCAA 800 
822 



2 ) INFORMATION FOR SEQ ID NO: 3 59 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

30 

__ r _ (-i-i^) MOLECULE— TYPE : - Genomic_DNA — 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthlca 
35 (B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

GTGTACAACG CTCTTGAGGT AG AAAAC GGT GCCAATAAGC TGGTGCTGGA 50 

40 AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 200 

GGGTGAACCA ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 3 00 

45 CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 350 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 400 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 450 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

50 AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

ACCATGGCGG AG AAATTC CG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG T AC CGAAGTG TCCGCACTTC 700 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 800 

55 CTCCG 805 



2 ) INFORMATION FOR SEQ ID NO : 3 60 

60 

418 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 

15 TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 150 

AAAGTTAACG AC CTCGAGC A CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

2 0 AC ATC GGCG A AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCAAGCTAC 3 00 

GAAGAGCTGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 3 50 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 400 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

25 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTACCGT TACACCCTGG 7 00 

CCGGTACCGA AGTGTCCGCA CTGCTCGGCC GTATGC CATC TGCGGTAGGT 7 50 

30 TATCAGCCAA CGCTGGCGGA AGAGATGGGC GTTCTG CAAG AACGTATCAC 800 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



3 5 2 ) INFORMATION FOR SEQ ID NO : 3 61 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 406 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(B) STRAIN: LCDC D7172 



50 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 61 



CCGTAAACAT GATGGAACTG ATC C GTAAC A TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

C TAC C AC GAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 150 

ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 200 

55 GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 250 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 300 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 3 50 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 400 

GATCAC 406 
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10 



2 ) INFORMATION FOR SEQ ID NO: 362 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
!5 (B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

2 0 TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 300 

25 GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

3 0 ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTGZCTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 65 0 

AGGTC GTG AC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 7 00 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 7 50 

TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGT ATC AC 800 

35 CTCCACCAAA AC TGGTTCTA TCACCTCCGT A 83 1 



2) INFORMATION FOR SEQ ID NO: 3 63 



40 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 1183 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 63 

55 GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 15 0 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGC CGT AT 2 00 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

60 AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTACGA AG AGC TGTC A 300 
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AACTCTCAGG AACTGCTGGA AACCGGTATC 
TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT 
GTAAAACCGT AAACATGATG GAGCTCATTC 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT 
5 CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTGTATGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGTC TGACCATGGC TGAGAAATTC 
TCTGCTGTTC GTTGACAACA TCTATCGTTA 
TATCCGCACT GCTGGGCCGT ATGCCTTCAG 
10 CTGGCGGAAG AG ATGGGC GT TCTGCAGGAA 
CG 



AAAGTTATCG ACCTGATGTG 350 

GTTCGGTGGT GCGGGTGTAG 400 

GTAACATCGC GATCGAGCAC 450 

GAACGTACTC GTGAGGGTAA 5 00 

CGTTATCGAC AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCT 600 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACGGAAG 7 00 

CGGTAGGTTA TCAGCCGACC 7 50 

CGTATCACCT CCACCAAAAC 800 

802 



15 2 ) INFORMATION FOR SEQ ID NO : 3 64 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12022 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 



ATGC C GTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC ~TGGAAGTTC A " GCAGCAGCTC GGCGGCGGTA~TCGTGCGTAC 100 

CATC GC AATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGC G AC TCTGGGCCGT 200 

35 ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

40 ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

45 AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AAC GTATC AC CTCCACCAAA 800 

ACTGGTTCTA TCACCTCCG 819 



50 



60 



2 ) INFORMATION FOR SEQ ID NO : 365 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 802 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

421 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 100 

CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

10 GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 200 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 250 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 300 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 350 

GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 400 

15 AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG AC TC C AACGT TATCGACAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA AC CGTCTGCG CGTTGCTCTG 600 

ACCGGTCTGA CCATGGCTGA G AAATTC CGT GACGAAGGTC GTGACGTTCT 650 

20 GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT 700 

' CCGCACTGCT GGGC CGT ATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 

mm 802 

TT 



25 



2 ) INFORMATION FOR SEQ ID NO: 3 66 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 7 85 bases 
_(.b.) ITYPJET: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 ( ii ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 66 

TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

45 CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 2 00 

ATGTACTAGG TGAAACAATT GAC CTTAAAG AAGAAATTAG TGATTCTGTT 2 50 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 300 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TT AC TAGC AC 35 0 

C TT AT ATT AA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 400 

50 AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 450 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTAC TCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 65 0 

55 TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 7 00 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 7 85 



60 
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2 ) INFORMATION FOR SEQ ID NO: 3 67 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aurlcularxs 

(B) STRAIN: ATCC 33753 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 67 



20 



25 



30 



TCGATGTTGA 


50 


TTACAATTAG 


100 


TGGTGTTAAA 


150 


TACCAGTCGG 


200 


ACAATTGATT 


250 


ACATAGAGAA 


300 


TAGAAACAGG 


350 


GGTAAAGTTG 


400 


CCAAGAATTA 


450 


TTGCCGGTGT 


500 


ATGAGCGACA 


550 


GAACGAACCA 


600 


TGGCTGAATA 


650 


GACAATATTT 


700 


AGGTCGTTTA 


750 


TGGGACAATT 


800 


TCA 


843 



35 



2 ) INFORMATION FOR SEQ ID NO: 3 68 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 849 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 368 

GC TTTGAAC A TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATCGAA 5 0 

GTTCCTAAAG AAGATAGCAC ACTTCATTTA AC TTT AG AAG TTGCACTTCA 100 

55 ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 150 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 2 00 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 2 50 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 3 00 

GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA AATCTTAGAA 3 50 

60 ACAGGTATCA AAGTAGTAGA CTTATTAGCA CCTTACATTA AAGGTGGTAA 400 
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AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
CGATAGTGGT GTAATTAAGA AAACAGCGAT 
5 AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGGCCAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGCCTTC AGCCGTTGGT TATCAACCAA 
CAATTACAAG AACGTATTAG TTCAACTAAT 



TAAGACAGTT TTAATCCAAG 450 

GTGGTATTTC AGTATTCGCC 500 

GACCTTTACT ATGAAATGAG 5 50 

GGTATTCGGT CAGATGAACG 600 

TATCAGGTTT AACAATGGCA 650 

GTATTATTAT TCATTGATAA 7 00 

AGTTTCAGCA TTACTTGGAC 7 50 

CACTTGCTAC TGAAATGGGT 800 

AAAGGTTCTG TTACTTCAA 849 



2 ) INFORMATION FOR SEQ ID NO : 3 69 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DMA 



20 



{ vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnxx 
25 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

G AAGTTC C AG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

3 0 AGGT AC TGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

" TAGTACGAAC AATCGCAATG GATTCAACTG AT^TCTTAA . ACGTGGTAC A 150 

GAAGTTAGAG ATAGCGGAAA T AGTATC AG C GTACCAGTTG GTAATGAAAC 200 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 300 

35 TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 350 

CATTGATTTA TTAGCACCAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 400 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 450 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

40 TTAAAAAGAC AGC TATGGT A TTTGGACAAA TGAACGAACC ACCTGGTGCG 600 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 7 00 

CTCAAGCTGG TTCAGAAGTT TC TGCGTTAT TAGGTCGTAT GCCTTCAGCT 750 

GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 800 

45 TATTACTTCT ACAACTAAAG GTTCAGTAAC 830 



50 



2 ) INFORMATION FOR SEQ ID NO: 370 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 87 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Staphylococcus epldeirmldls 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 

AATAATGCAT TACACATCGA AGTTCCTAAA GAAGATGGAG CGCTTCAATT 50 

5 AACATTAGAA GTTGCACTTC AACTAGGTGA CGATGTAGTT CGTACAATTG 100 

CAATGGACTC AACTGACGGC GTTCAAAGAG GAATGGAAGT TAAAGATACA 150 

GGT AG AG AC A TAAGTGTACC TGTCGGTGAC GTAACTC TAG GAAGAGTGTT 2 00 

TAACGTACTA GGAGAAACTA TTGACTTAGA TGAAAAAATT GATGATTCAG 250 

TACGACGTGA CCCTATCCAT AGACAAGCTC CAGGATTCGA CGAATTATCA 3 00 

10 ACAAAAGTAG AAATC TTAG A AACTGGTATT AAAGTAGTAG AC TT ATTAGC 350 

ACCTTACATA AAAGGTGGTA AAATTGGATT ATTTGGTGGT GCCGGTGTAG 400 

GTAAAACCGT ACTAATCCAA G AAC TT ATT A ATAACATCGC TCAAGAACAC 450 

GGTGGTATCT CAGTATTCGC TGGTGTTGGT GAACGTACAC GTGAAGGTAA 500 

TGATCTTTAC TATGAAATGA GTGACAGTGG TGTTATCAAG AAAACTGCAA 550 

15 TGGTCTTTGG TCAAATGAAT GAGCCACCTG GTGCACGTAT GCGTGTAGCA 600 

TTATCCGGAT TAACAATGGC CGAATATTTC CGAGATGAAG AAGGCCAAGA 650 

TGTGTTATTA TTCATTGATA ACATTTTCAG ATTCACTCAA GCTGGTTCAG 7 00 

AAGTTTCTGC GTTATTAGGT CGTATGCCAT CAGCTGTTGG TTATCAACCT 7 50 

ACACTTGCTA CAGAAATGGG TCAATTACAA GAACGTA 7 87 

20 

2) INFORMATION FOR SEQ ID NO : 371 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolytlcus 
35 (B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 

GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

40 TGGT AC TTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATTCAACAG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA AC AC TGG AAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 2 00 

TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 2 50 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 3 00 

45 TTTG AC G AAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 3 50 

TGT AG AC TT A TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 400 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 450 

ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TAC AC GTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

50 TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC AC CTGGTGC A 600 

CGTATGCGTG TGGC AC TTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 7 00 

CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 7 50 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TAC AAGAAC G 800 

55 TATTACATCA ACGAATAAAG GTTCAGTAAC 830 



2 ) INFORMATION FOR SEQ ID NO : 372 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , . u« mn -„,- e 

(A) ORGANISM: Staphylococcus hommis subsp. homxnis 

(B) STRAIN: ATCC 27844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 

15 TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 100 

SSwSmgat GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

aacgtggtat GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 200 

PTTCAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

20 ?S2£cg1£ aaaaSgata GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

20 £™J5SJ* n££nr-ATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 3 50 

GGTATTAAAG TTGT AG AC TT ATTAGCACCT TATATTAAAG GTGGTAAGAT 400 

TGGTTTATTC GGTGGTGC CG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 450 

EXJSSSX TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 500 

?s gtIggtgaac gtactcgtga AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

taSSgtc ATGAACAAGG TCAAGATGTA CTTTTATTCA ttgacaatat 700 

TTTCC GCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 

30 TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 800 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 



35 2 ) INFORMATION FOR SEQ ID NO: 373 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
40 <c) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: , , 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 



TATTGAAGTA 


50 


CATTGCAACT 


100 


GATGGTGTTC 


150 


TGTTCCTGTA 


200 


AAACAATAGA 


250 


ATTCATCGTC 


300 


TCTTGAAACA 


350 


GTGGTAAGAT 


400 


ATTCAAGAAT 


450 


ATTCGCTGGT 


500 
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GTAGGTGAAC GTACTCGTGA AGGTAACGAT 
TAGTGGCGTT ATCAATAAAA CAGCCATGGT 
CGCCAGGTGC GCGTATGCGT GTTGCTTTAT 
TATTTCCGTG ATGAACAAGG TCAAGATGTA 
5 TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT 
TGCCTTCAGC TGTAGGTTAT CAACCTACAT 
TTACAAGAAC GTATTACATC TACTAATAAA 



TTATACTATG AAATGAGCGA 550 

ATTTGGGCAA ATGAATGAGC 600 

CAGCATTGAC AATGGCTGAA 650 

CTTTTATTCA TTGACAATAT 7 00 

TTCAGCATTA TTAGGACGTA 750 

TAGCAACTGA AATGGGTCAA 800 

GGTTCAGTCA CTTCAA 846 



10 

2 ) INFORMATION FOR SEQ ID NO : 37 4 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 835 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensxs 

(B) STRAIN: ATCC 43 809 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 5 0 

AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

30 GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 200 

G AAAC ATT AG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 2 50 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 300 

CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTC TTG A AACTGGTATT 350 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 400 

35 GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 450 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 55 0 

CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

40 CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA AT ATATTC CG 7 00 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 7 50 

CTGCCGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 83 5 



2 ) INFORMATION FOR SEQ ID NO: 37 5 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37 5 

GAGCACAATG AAGTTCCAGA AATTAACAAT GCCTTAGTCG TAGACGTTGA 50 

AAGAGATGAA GGTACAGTAT C TC TT AC ATT AGAAGTGGCA TTACAACTTG 100 

5 GCGATGATGT C GTACGTAC A ATTGCAATGG ATTCTACTGA TGGTGTTAAA 150 

CGTGGTACAG AAGTTCGAGA TAGCGGAGAT AGCATCAGTG TTCCAGTTGG 2 00 

TGATGCTACG TTAGGACGTG TGTTTAATGT TCTTGGTGAT ACAATTGACT 250 

TAGACGAGAA GCTTGATACT TCTGTCAAAC GTGATCCAAT TCATAGAGAA 300 

GCACCTGCAT TCGATCAATT ATCAACAAAA GTTGAAATCT TAGAAACAGG 350 

10 TATTAAAGTA ATTGATTTAC TTGC AC C ATA TATTAAAGGT GGTAAAATCG 400 

GTTTATTCGG TGGCGCTGGT GTAGGTAAAA CAGTATTAAT TCAAGAATTA 450 

ATTAATAATA TAGCTCAAGA ACATGGTGGT ATTTCAGTAT TTGCCGGCGT 500 

AGGTGAACGT ACGCGTGAAG GTAATGACTT ATACTACGAA ATGAGTGATA 5 50 

GTGGTGTTAT TAAGAAAACA GCTATGGTCT TCGGACAAAT GAATGAGCCA 600 

15 CCTGGTGCGC GTATGCGTGT TGCTTTATCA GGCTTAACAA TGGCTGAACA 650 

CTTC CGTGAT G T AC AAGG AC AAGATGTTTT ACTATTTATT GATAACATAT 7 00 

TCAGATTTAC GCAAGCTGGT TCAGAAGTAT CAGCACTATT AGGTCGTATG 750 

CCATCAGCCG TTGGTTATCA ACCTACCCTT GCTAC TG AAA TGGGTCAATT 800 

ACAAGAACGT ATTACATCAA CAACTAAAGG ATCTGTAACG TC 842 

20 

2 ) INFORMATION FOR SEQ ID NO: 37 6 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



_( ii ) MOLECULE !"^rYPE :~ jGenom^ DNA — 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 
35 (B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 

TGATGAACTG CCTAAGATTA ATAACGCATT AGTGC TAG AT GTACCTAAGA 50 

40 AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 100 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 15 0 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 200 

AAGAT AC GTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 2 50 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC AT AGAC AAC C 3 00 

45 ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 35 0 

TCAAAGTAGT AGACTTATTA GC AC CATATA TCAAAGGTGG TAAAGTTGGT 400 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 450 

TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 500 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

50 GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 7 00 

CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GC CGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AAC AG AAATG GGTCAATTAC 800 

55 AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 



60 



2 ) INFORMATION FOR SEQ ID NO : 377 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

{ vi ) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

15 CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 5 0 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 100 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 200 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 250 

2 0 AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 3 00 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 3 50 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 4 00 

TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 450 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

2 5 TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 55 0 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 7 00 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 7 50 

3 0 TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA . 800 

AGAACGAATT ACATCTACAA-ATAAAGGTTC TGTAACATCT A 841 



3 5 2 ) INFORMATION FOR SEQ ID NO: 37 8 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominlmus 

(B) STRAIN: ATCC 51726 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 



TTTAACACGA ATGAACCGCT TCCTGAGATA AATAATGCAC TTGTTGTTTA 50 

CAAAGACAGT GAGAAAAAAC ATAAAATCGT TCTTGAAGTA GCTCTTGAAC 100 

TTGGTGAAGG CCTCGTTCGT AC C ATTGCTA TGGAATCAAC TGATGGTTTG 150 

AC AC GTGGTC TAGAAGTTCT TGATACAGGC CGTGCAATCA GTGTACCAGT 2 00 

55 TGGTAAAGAA ACGCTTGGAC GTGTCTTCAA CGTTCTTGGT GATGCTATCG 2 50 

ATCTTGAAGA ACCATTTGGA GAAGATGCAG AACGTCACCC CATTCATAAG 3 00 

AGTGCCCCAA CTTTTGATGA ATTATCAACG TCAACAGAAA TCCTTGAAAC 3 50 

AGGGATTAAA GTTATCGACC TACTTGCCCC TTACTTAAAA GGAGGGAAGG 4 00 

TTGGACTTTT CGGTGGTGCC GGAGTTGGTA AGACCGTTCT TATCCAAGAG 450 

60 TTGATTCATA ACATTGCTCA AGAGCATGGT GGTATTTCAG TATTT AC CGG 5 00 

429 
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AGTTGGTGAA CGTACACGTG 
AATCAGGCGT TATTGAAAAA 
CCACCTGGTG CACGTATGCG 
ATATTTCCGT GATGTTGAAG 
5 TTTTTCGTTT CACACAAGCA 
ATGCCATCAG CCGTTGGTTA 
ATTGCAAGAA CGTATCACGT 



AAGGTAATGA CCTCTATTGG 

ACAGCTATGG TATTTGGTCA 

TGTAGCCCTT ACTGGTTTGA 

GACAGGACGT GCTTCTCTTT 

GGTTCTGAAG TTTCAGCTCT 

TCAACCAACC TTGGCAACTG 

CAACTAAAAA AGGTTCTGTT 



GAAATGAAAG 550 

GATGAATGAG 600 

CAATCGCTGA 650 

ATTGATAACA 7 00 

TCTTGGACGT 7 50 

AAATGGGTCA 800 

ACATCA 846 



10 

2 ) INFORMATION FOR SEQ ID NO : 379 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 846 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12403 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTC TAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA C TTG AAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

30 CACGTGGTTT GGAAGTATTA G ATAC TGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA XTTTGGGTCG jraTCTTCAAC. GTTCTTGGAG ATGCTATTGA 2 5 0 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

3 5 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 5 00 

GTAGGAGAAC GTAC TCGTG A AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

40 TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 7 00 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 7 50 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 8 00 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



45 



2 ) INFORMATION FOR SEQ ID NO : 3 80 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactlae 

(B) STRAIN: ATCC 13813 

60 
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(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 380 



TTGCAAGTGG 
AAAAATGGCG 
5 TGGTGACGGC 
CACGTGGTTT 
GGTAAGGATA 
TCTTGAAGAG 
AAGCACCATC 

10 GGTATTAAAG 
TGGACTTTTC 
TAATCCACAA 
GTAGGAGAAC 
ATCTGGCGTT 

15 CACCAGGAGC 
TACTTCCGTG 
CTTCCGTTTC 
TGCCTTCAGC 
TTACAAGAGC 

20 



CGACAAACTT 
ATAAGTCACA 
CTCGTTCGTA 
GGAAGTATTA 
CTTTGGGTCG 
CCTTTTGCAG 
GTTTGATGAA 
TTATTGACTT 
GGTGGTGCGG 
CATCGCCCAA 
GTACTCGTGA 
ATTGAAAAAA 
ACGTATGCGT 
ATGTAGAAGG 
ACACAAGCTG 
CGTTGGTTAT 
GTATCACTTC 



CCTGAGATTA 
AAAAGTAGTA 
CAATCGCTAT 
GATACTGGTC 
TGTCTTCAAC 
AAGATGCAGA 
TTATCAACAT 
ATTAGCACCT 
GTGTTGGTAA 
GAACATGGTG 
AGGGAATGAC 
CGGCTATGGT 
GTGGCACTTA 
ACAAGATGTG 
GGTCAGAAGT 
CAACCAACAC 
AACTAAAAAA 



ATAATGCATT 

CTTGAAGTTG 

GGAATCAACT 

GCGCAATTAG 

GTTCTTGGAG 

ACGTCAACCA 

CATCAGAAAT 

TACTTAAAAG 

AACCGTTCTT 

GTATTTCAGT 

CTTTATTGGG 
CTTTGGTCAA 

CTGGTC TT AC 
CTTCTCTTCA 
GTCAGCGCTT 
TTGCTACAGA 
GGTTCTGTTA 



G ATTGTC TAT 50 

CTCTTGAACT 100 

G ATGGGC TT A 150 

TGTGCCGGTT 2 00 

ATGCTATTGA 2 50 

ATCCATAAAA 3 00 

C TT AG AAACT 3 50 

GTGGTAAAGT 400 

ATTCAAGAAT 4 50 

ATTTACTGGT 500 

AAATGAAAGA 550 

ATGAATGAAC 600 

AATAGCTGAG 650 

TTGATAATAT 7 00 

TTAGGTCGTA 7 50 

AATGGGACAA 800 

CCTCAA 846 



2 ) INFORMATION FOR SEQ ID NO: 381 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
35 (B) STRAIN: ATCC 12973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT G ATTGTC TAT 50 

40 AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT G ATGGGC TTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

45 AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

50 ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTC TT AC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 7 00 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 7 50 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

55 TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 



2) INFORMATION FOR SEQ ID NO: 3 82 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 

15 TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGC TTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

2 0 CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAG CAC CATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 35 0 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTAC TGGT 500 

25 GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 7 00 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 7 50 

3 0 TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 845 



35 2 ) INFORMATION FOR SEQ ID NO : 383 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

55 GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAG CAC CATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 450 

60 TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

432 



CA 02307010 2000-05-19 



10 



20 



45 



GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT C TTTGGTC AA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 65 0 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 7 00 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 84 5 



2 ) INFORMATION FOR SEQ ID NO: 384 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 845 bases 
15 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT C C AG AG ATT A ATAATGCATT GATTGTTTAT 50 

AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GG AATCAACT GATGGGCTTA 150 

3 0 CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGT AC CAGT A 2 00 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT G T AC TTGG AG AAACCATTGA 250 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 3 50 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 4 00 

35 TGG AC TTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

40 TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 7 00 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 75 0 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 84 5 



2 ) INFORMATION FOR SEQ ID NO : 385 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp. equi 

(B) STRAIN: ATCC 952 8 

60 

433 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

TTGCGAGTGG GGACAAACTA C C AG AG ATT A ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

S TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCC ATT AG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 2 50 

CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 300 

AAGCACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 3 50 

10 GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTAC CGGT 500 

GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 6 00 

15 C AC C AGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 7 00 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 800 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 846 

20 



2 ) INFORMATION FOR SEQ ID NO : 3 86 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 



30 



-(i i )-MOLECULE— TYPE-: - Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 
35 (B) STRAIN: ATCC 27335 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 386 

TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

40 AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 100 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC G AC C AATTTC TGTTCCAGTT 2 00 

GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 2 50 

TTTGGATACT CCATTCGGCG AAG ATGC AG A ACGTCAACCA ATCCATAAAA 300 

45 AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 3 50 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 400 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAAC TCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

50 ATC TGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 7 50 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

5 5 TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 846 



2 ) INFORMATION FOR SEQ ID NO: 3 87 

60 

434 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

( vi ) ORIGINAL SOURCE: 
10 ( A ) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7 073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 

1* GCAGCTGGTG ATAAACTTCC TGAGATTAAC AATGCATTGG TCGTTTATAC 50 

tca?§2S£a AAGTCTAAAC GTATCGTGCT CGAAGTAGCT CTTGAACTTG 100 

^SS^TGT GGTTCGTACC ATTGCCATGG AATCTACTGA TCGATTGACT 150 

CGTGGACTAG AAGTTC TGG A CACTGGTCGT CCAATCAGCG TTCCTGTTGG 2 00 

SSSgatacc CTTGGACGTG TCTTTAACGT TCTTGGTGAT ACCATTGACT 250 

20 ?SS5agcacc TTTTGCAGAC GATGCAGAGC GTGAACCAAT TCACAAAAAA 300 

^ScAACTT TCGATGAATT GTCAACATCT AC TG AAATCC TTGAAACAGG 350 

GATTAAAGTT ATCGACTTGC TAGCCCCTTA CCTTAAGGGT GG T AAAGTCG 400 

GACTCTTCGG TGGTGCCGGT GTTGGTAAAA CCGTTCTTAT TCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCCGTGT TTACAGGTGT 500 

25 TGGTGAACGT ACACGTGAAG GTAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAAAT GAACGAACCA 600 

CCTGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCGGAATA 650 

CTTCCGTGAT GTCGAGGGTC AAGACGTTCT TCTCTTCATC GATAACATCT 7 00 

TCCGTTTCAC TCAAGCAGGT TCTGAGGTTT CTGCCCTTCT TGGTCGTATG 7 50 

30 CCATCAGCCG TTGGTTACCA AC CTAC ACTT GC T AC TG AAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAAAAAGG TTCTGTTACA • TCT 843 



35 2 ) INFORMATION FOR SEQ ID NO: 388 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 



TTGCAGCAGA AG ATAAAC TT CCTGAGATTA ACAACGCACT CGTTGTATAT 50 

AAAAATGATG ATTCCAAACA AAAAGTCGTG CTTGAAGTGG CTTTGGAACT 100 

TGGTGATGGC GTTGTACGGA CCATTGCCAT GGAATCAACG GATGGATTGA 150 

CACGTGGGAT GG AAGTTC TC GATACAGGTC GTCCCATCTC TGTTCCAGTC 200 

55 GGTAAAGAAA CGCTGGGTCG TGTCTTCAAT GTGTTGGGAG ATACCATTGA 2 50 

C CTTGAAG AG TCTTTTCCGG CAGATTTTGA ACGTGAGCCT ATCCATAAGA 3 00 

AAGCGCCGGC TTTTGACGAA TT ATC T AC TT CAAGCGAAAT TTTGGAAACA 350 

GGGATTAAGG TTATCGACCT CCTAGCACCT T ATC TAAAAG GTGGTAAGGT 4 00 

TGGTCTCTTC GGTGGTGCTG GTGTTGGTAA AACCGTTCTT ATCCAAGAAT 450 

60 TGATTCACAA TATTGCCCAA GAACACGGTG GTATCTCTGT ATTTACCGGA 500 

435 
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GTTGGCGAGC GTACCCGTGA AGGGAACGAT 
ATCAGGTGTT ATTGAAAAAA CGGCCATGGT 
CACCAGGAGC CCGTATGCGT GTTGCTCTTA 
TACTTCCGTG ATGTGGAAGG GCAGGATGTT 
5 CTTCCGTTTC ACACAGGCTG GTTCAGAAGT 
TGCCATCAGC CGTTGGTTAT CAGCCAACAC 
TTGCAGGAGC GTATTACCTC AACCAAGAAG 



CTTTACTGGG AAATGAAAGA 550 

ATTTGGTCAG ATGAATGAGC 600 

CTGGTTTGAC TATTGCGGAA 650 

CTTCTGTTCA TCGATAATAT 7 00 

GTCTGCCCTC TTGGGTCGTA 7 50 

TTGCGACGGA GATGGGACAA 800 

GGTTCTGTTA C 841 



10 2 ) INFORMATION FOR SEQ ID NO: 3 89 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 844 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

' vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uteris 

(B) STRAIN: ATCC 1943 6 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 100 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 150 

3 0 CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 2 00 

AAAAGAAACT TTGGGTC GTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 2 50 

TGGATGAACC ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 3 00 

GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 400 

35 GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 450 

ATTCATAATA TTGC AC AAG A ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

40 TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 7 50 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



45 



60 



2 ) INFORMATION FOR SEQ ID NO: 390 



{ i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 

(B) STRAIN : ATCC 33301 



436 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

TTCCCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 50 

TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

5 TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 150 

AAGGTG AC CG ATCTGCAAAA ACCGATTCAG GTACCGGTCG GTAAAGCGAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTG AAC TC TATTCACCGT 3 00 

GCAGCGCCAA GCTACGAAGA ACTGGCTAAC TCTACTGAGC TGCTGGAAAC 350 

10 GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 400 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 450 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 5 00 

GGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGACCG 550 

AGTCTAACGT TC TGG AT AAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

15 CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GATAACATCT 7 00 

ATCGTTATAC CCTGGCCGGT AC TG AAGTTT CAGCACTGCT GGGTCGTATG 7 50 

CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 800 

TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 896 



20 



2 ) INFORMATION FOR SEQ ID NO : 391 

2 5 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulslella guamensis 

3 5 (B) STRAIN: ATCC 49490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

4 0 TGG TAGTG AG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 100 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 300 

45 GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAAC GT AC 5 00 

TCGTGAGGGT AACGACTTCT AC C AC G AAAT GACCGACTCC AACGTTCTGG 55 0 

50 ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

CGGGTAC TG A AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

55 CTCAACCAAA ACCGGTTCTA TCACCTCCG 829 
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2 ) INFORMATION FOR SEQ ID NO : 392 

437 
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10 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNE S S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia bercovieri 

(B) STRAIN: ATCC 43970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 



15 CGAATTCCCC 
AAGGCACAGC 
GGTGTTGTTC 
GTTGAAAGTC 
CAACTCTGGG 

20 GGTCCTATCG 
TTACGAAGAG 

xrin J. OO^A 1C1 

GGTGGTGCGG 
TATTGCGATT 

25 GTACTCGTGA 
CTGGACAAAG 
CCGTCTGCGC 
ATGAAGGTCG 
CTGGCCGGTA 

3 0 AGGCTATCAG 
TCACTTCCAC 



CAAGACGCTG 
TCAGAAGCTG 
GTTGTATCGC 
ATCAACCTGG 
CCGTATCATG 
GTGAAGAAGA 
CTTGCCAGCT 
GATTTGTCCG 
GTGTGGGTAA 
GAGCACTCAG 
GGGTAACGAC 
TATCCTTGGT 
GTTGCACTGA 
TGATGTACTG 
CAGAGGTATC 
CCAACGCTGG 
CAAGACGGGT 



TACCAAAAGT 
GTGCTGGAAG 
AATGGGCTCT 
AACACCCAAT 
AACGTATTGG 
GCGTTGGGCA 
CGCAAGATCT 
TTCGCTAAGG 
AACAGTCAAC 
GTTATTCTGT 
TTCTACCACG 
TTATGGCCAG 
CCGGCTTGAC 
TTATTCATCG 
TGCACTGCTA 
CAGAAGAGAT 
TCAATCACCT 



GTACAACGCC 
TTCAGCAACA 
TCCGATGGTC 
TGAAGTGCCG 
GTGACCCAAT 
ATCCACCGCG 
GTTAGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
ATTTGCCGGT 
AGATGACTGA 
ATGAATGAGC 
CATGGCGGAG 
ATAACATCTA 
GGTCGTATGC 
GGGTGTGTTG 
CCGTA 



CTTGAGGTTG 50 

GCTGGGCGGT 100 

TGAGCCGCGG 150 

GTGGGTAAAT 200 

CGACATGAAA 2 50 

AAGCGCCTTC 3 00 

GGTATCAAGG 350 

CGGTCTGTTC 400 

TGATTCGTAA 450 

GTGGGTGAGC 500 

CTCCAACGTT 5 50 

CACCAGGTAA 600 

AAATTCCGTG 650 

TCGTTATACC 7 00 

CATCAGCGGT 750 

CAGGAACGTA 800 
835 



35 2 ) INFORMATION FOR SEQ ID NO: 393 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 
40 ( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9 610 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA ■ 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 200 

55 CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 250 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC x 300 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 35 0 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACGGT AAACATGATG GAGCTTATTC GTAACATTGC GATTGAGCAC 450 

60 TCAGGTTATT CCGTATTTGC TGGCGTGGGT GAGCGTACTC GTGAGGGTAA 500 

438 
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10 



20 



45 



60 



CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTTTATGG CCAAATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

CTGACCGGCT TGACCATGGC G G AG AAATTC CGTGATGAAG GTCGTGACGT 650 

ATTGCTGTTC ATCGATAACA TCTATCGCTA TACCTTAGCC GGTACGGAAG 7 00 

TTTCCGCACT GCTGGGTCGT ATGCCATCTG CCGTAGGTTA CCAGCCAACG 750 

CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACCAAGAC 800 

GGGTTCAATC AC 812 



2 ) INFORMATION FOR SEQ ID NO: 394 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 802 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33 641 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT AC TGCTGAG A AGTTAGT AC T 50 

GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CC TGGAAC AC 150 

3 0 CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 200 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 2 50 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 300 

GATCTGTTAG AAAC CGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 3 50 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 4 00 

35 TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 55 0 

GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 65 0 

40 CATCGATAAC ATCTATCGTT AT AC C TTGGC CGGTACGGAA GTATCCGCAC 700 

TGC TGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA AC GT ATT ACT TCCACCAAGA CGGGTTCAAT 800 

CA 802 



2 ) INFORMATION FOR SEQ ID NO: 395 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 

(B) STRAIN: ATCC 29909 



439 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 95 

GCTGTACCAA GAGTGTACAA CGCCCTTGAG GTTGAAGGCA CTGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTAGG CGGTGGTGTT GTTCGTTGTA 100 

5 TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GCGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCGGTTGGT AAATCAACTC TGGGCCGTAT 200 

CATGAACGTA TTGGGTG AC C CAATCGACAT GAAAGGTCCT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCAATCCAC CGCGAAGCGC CTTCTTACGA AGAGCTTGCC 300 

AGCTCACAAG ATTTGTTAGA AACCGGTATC AAAGTAATGG ACTTGATTTG 350 

10 CCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 400 

~" GTAAAACAGT AAACATGATG GAGCTTATTC GTAACATCGC GATTGAGCAC 45 0 

TCAGGTTATT C TGTATTTGC TGGTGTGGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTGTATGG CCAGATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

15 CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 65 0 

ACTGTTGTTC ATCGATAACA TCTATCGCTA TACCTTGGCC GGTACGGAAG 7 00 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCAACG 7 50 

CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACGT CCACCAAGAC 800 
GGGTTC 

20 



806 



2 ) INFORMATION FOR SEQ ID NO : 396 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



( i i ) MOLECULE TYPE : Genomi c DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 
35 (B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 

GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 5 0 

40 GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 150 

CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 2 00 

CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 25 0 

AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 300 

45 AGCTCACAAG ATC TGTT AG A AACCGGTATC AAGGTTATGG ACCTGATTTG 3 50 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAG CGTAC CC GTGAGGGTAA 500 

TGACTTCTAC CATGAAATGA CTGACTCCAA C GTTTTGG AC AAAGTATCCT 550 

50 TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGCCAACA 75 0 

CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT C C ACT AAGAC 800 

5 5 GGGTTC 806 



2 ) INFORMATION FOR SEQ ID NO: 397 

60 

440 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

( C ) STRAND EDNES S : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397 

15 TTCCCCCAAG ACGCTGTACC AAAAGTGTAC AACGCCCTTG AGGTTGAAGG 50 

TGCAGCTGAG AAGC TTGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TTGTTCGTTG TATCGCAATG GGCTCTTCCG ATGGTTTGAG CCGTGGGTTG 150 

AAAGTTATCA ACCTGGAACA CCCAATTGAA GTGCCAGTTG GTAAATCAAC 200 

TCTGGGCCGT ATCATGAACG TATTGGGTGA CCCAATCGAC ATG AAAGGC C 2 50 

2 0 CTATCGGTGA AGAAGAGCGT TGGGCAATCC ACCGTGAAGC GCCTTCTTAC 3 00 

GAAGAG CTTG CCAGCTCGCA AGATCTGTTA GAAACCGGTA TCAAGGTAAT 350 

GGATCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAGCTTAT TCGTAACATC 450 

GCGATTGAGC ACTCAGGTTA TTCTGTATTT GCCGGGGTAG GTGAACGTAC 5 00 

25 TCGTGAGGGT AACGACTTCT ACCACGAGAT GACTGACTCC AACGTTCTGG 550 

ACAAAGTATC CTTGGTTTAT GGCCAGATGA ATGAGCCACC AGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CTTG AC C ATG GCGGAAAAAT TCCGTGATGA 650 

AGGC CGTG AC GTATTGCTGT TCATCGATAA CATTTATCGT TATACCCTAG 7 00 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCATC TGCGGTAGGC 75 0 

3 0 TATCAGCCAA CACTGGCAGA AGAGATGGGT GTGTTGCAGG AACGTATTAC 800 

TTCCACTAAG ACGGGTTCAA TCACCTCCG 82 9 



3 5 2 ) INFORMATION FOR SEQ ID NO : 398 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN : ATCC 35313 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAAAA TGGTAACGAG 50 

AAAC TGGTGC TGGAAGCTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC CCTGGGCCGT 2 00 

5 5 ATCATGAACG TCCTGGGTCA GCCGATCGAC ATGAAAGGCG ACATCGGTGA 2 50 

AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCTTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTAAT GGATCTGATC 3 50 

TGCCCGTTCG CTAAGGGTGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACT GTAAACATGA TGGAGCTTAT CCGTAACATC GCGATTGAGC 450 

60 ACTCCGGTTA CTCTGTGTTT GCAGGCGTGG GTGAACGTAC TCGTGAGGGT 500 
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10 



20 



50 



AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG ATAAAGTATC 550 

CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGCCGTGAC 65 0 

GTTCTGCTGT TCGTCGATAA CATCTACCGT TATACCCTGG CCGGTACGGA 7 00 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC TGCGGTAGGT TATCAGCCAA 750 

CTCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC CTCTACCAAA 800 

ACCGGTTCTA TCACCTCCG 819 



2 ) INFORMATION FOR SEQ ID NO : 399 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1097 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 3 8295 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 

AAGCTTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 5 0 

GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 15 0 

3 0 GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 2 00 

"CAAGGATGGT— CAGACGGGAG -AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 2 50 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 3 00 

CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 350 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 400 

3 5 ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 450 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 500 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTT AC CTT 650 

40 CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 7 00 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 7 50 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 9 00 

45 TGCCACACTG CCCACATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CG ACCGACGA ACCGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC 1050 

GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 1097 



2 ) INFORMATION FOR SEQ ID NO: 400 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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CA 02307010 2000-05-19 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidla corymbifera 

(B) STRAIN: ATCC 4 6775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 



CAAGCTTAAG 
GGAAGTTCGA 

10 CATCGTGATT 
CGGTATCTTG 
CCAAGGATGG 
GTCCGTCAAT 
TGAGGCCCGT 

15 AGATTGGTTT 
AACGGTGACA 
ATGGAACAAG 
AAGCCATTGA 
CGTCTTCCCC 

2 0 TGTCGGTCGT 

TCGCTCCCGC 
GAGCAACTTG 
GAACGTTTCC 
AG AACG AC CC 
25 TTGAACCACC 
CCACACTGCT 
ATYGTCGTTC 
TTAATCTYTG 
AC TTTTTG AT 

3 0 GTAAGAAGTT 

ATCGTCAAGA 
ATATCCTCCT 



GCTGAACGTG 
GACTCCCAAG 
TCATCAAGAA 
ATTATTGCTG 
TCAAACCCGT 
TGATTGTCGC 
TACAACGAAA 
CAACCCCAAG 
ACATGTTGGA 
GAGACTAAGG 
CAACATTGAT 
TTCAAGATGT 
GTTGAGACTG 
TAACGTCACC 
CTGAAGGTGT 
GTCAAGGATA 
CGCCAAGGAA 
CTGGTCARAT 
CACATTGCAT 
CGGTAAGTAA 
GATTTTGCCT 
CATTTGCTAA 
GGAAGACTCC 
TGGTTCCTTC 
CTTGGTCGTT 



AGC GTGGTAT 
TAC CACGTTA 
CATGATTACT 
CTGGTACTGG 
GAACACGCTT 
TATCAACAAG 
TTGTCAAGGA 
TCCGTTCCTT 
GGARTCCACC 
CTGGTGCCAA 
CCCCCTGTTC 
CTACAAGATC 
GTGTCATCAA 
ACTGAAGTCA 
TCCCGGTGAC 
TCCGCCGTGG 
TCCGCTTCCT 
TGGTGCTGGT 
GCAAGTTCTY 
ATANTTTGGT 
CAATTGCTCC 
TCCAAACCCT 
CCCAAGTTCG 
CAAGCCCATG 
TCGCTGTCCG 



CACCATCGAT 
CCGTCATTGA 
GGTACTTCCC 
TGAATTCGAA 
TGCTTGCTTT 
ATGGATTCCA 
AGTCTCCACC 
TCGTCCCTAT 
AACATGCCTT 
GACYGGCAAG 
GTCCTTCCGA 
GGTGGTATTG 
GCCTGGTATG 
AGTCCGTYGA 
AACGTCGGTT 
TAACGTYTGC 
TCACCGCTCA 
TACTCTCCTG 
TKAGCTTCTT 
TTRGGATATG 
TTCC TTGATC 
TTCCATTTYA 
TCAAGWSYGG 
TGCGTTGAAG 
TGA 



ATCGCTCTCT 50 

TGCCCCTGGC 100 

AAGCTG AC TG 150 

GCTGGTATCT 2 00 

CACCCTTGGT 250 

C C AAGT ACTC 300 

TTCATCAAGA 3 50 

CTCTGGCTGG 400 

GGTTCAAGGG 450 

ACCCTTCTTG 5 00 

CAAGCCCCTT 550 

GTACAGTTCC 600 

GTTGTCACCT 650 

AATGCACCAC 7 00 

TCAACGTCAA 75 0 

TCTGACTCCA 8 00 

AGTTATTGTC 850 

TCTTGGATTG 900 

KAGAAGATCG 950 

GGTATTGGGC 1000 

TTTCTCGATT 1050 

TTGAAAACAG 1100 

TGACTCTGCY 1150 

CCTACACTGA 12 00 
1233 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 401 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1151 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 100 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

55 GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 2 00 

GGCCGACTGC GCTATTCTCA TCATTGCCGC C GGTAC TGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGC ACGCTC T CCTCGCTTAC 3 00 

ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 3 50 

CAAGTGGTCC GAGGAGCGTT ACCAGGAGAT CATCAAGGAG ACCTCCAACT 400 

60 TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 450 

443 
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TCCGGTTTCA ACGGTGACAA CATGATTGAG GCCTCATCCA ACTGCCCCTG 500 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 550 

TCCTCGAGGC CATCGACGCC ATCGACCCTY CCAGCCGTCC CACCGACAAG 600 

CCCCTCCGTC TTCCCCTYCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 650 

5 GGTGCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGGCC GGTATGGTCG 7 00 

TCACCTTCGC CCCCGCTGGT GTCACCACTG AAGTCAAGTC CGTCGAGATG 7 50 

CACCACGAGC AGCTCACCGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 800 

CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 85 0 

ACTC CAAGAA CGACCCCCCC AAGGGTGCCG AGTCCTTCAA CGCCCAGGTC 9 00 

10 ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCTGGTTACG CCCCAGTCCT 950 

CGACTGCCAC ACCGCCCACA TTGCTTGCAA GTTCTCTGAG CTCCTCGAGA 1000 

AGATTGACCG CCGTACCGGA AAGTCTGTTG AGAACTCTCC CAAGTTCATC 1050 

AAGTCCGGTG ACGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1100 

CGTTGAGGCT TTCACTGACT ACCCTCCTCT CGGTCGTTTC GCTGTCCGTG 1150 

15 A 1151 



20 



2 ) INFORMATION FOR SEQ ID NO: 402 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Aspergillus flavus 

S~ZZ(3J: ' STRAIN: ATCC 2 6947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 

3 5 CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 100 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTTCAAT 150 

AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 2 00 

CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 250 

40 GAGGCTGGTA TCTCCAAGGA TGGTC AG AC C CGTGAGCACG CTCTGCTCGC 300 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 3 50 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 400 

AACTTCATCA AG AAGGTC GG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 450 

CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 500 

45 CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 550 

AAGACCCTTC TCGAGGC CAT CGATGCCATC GAGCCCCCCG TCCGTCCCAC 600 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGCCTGGT 7 00 

ATGGTCGTTA CTTTCGCTCC TGCCAACGTG ACCACTGAAG TCAAGTCCGT 750 

50 TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 800 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 850 

GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 900 

• CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGC TGAGCTC 1000 

55 CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG AC AAGC CCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 12 00 

ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 12 5 0 

60 TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 12 83 

444 
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2 ) INFORMATION FOR SEQ ID NO : 403 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 

2 0 TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 200 

CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 250 

25 ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 300 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC AC CTGC AAGT GGTCCGAGGA 350 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 400 

GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 45 0 

GACAACATGC TTGAGCCCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

3 0 GAAGGAGACC AAGGC CGGC A AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

TCGACGCCAT TGAGCCCCCT -GTCCGTCCCT CCAACAAGCC CCTCGGTCTT ■ 600 

CCCCTCCAGG "ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 7 00 

CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

35 CTCCAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 800 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

TGCCCACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 1000 

40 GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 1100 

CAC 1103 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 404 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1149 bases 
50 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 

445 



CA 02307010 2000-05-19 



AAGCTCAAGT CCGAGCGTGA GCGTGGTATC AC C ATCG AC A TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

5 GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 250 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 400 

10 AG AAGGTC GG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 5 00 

GGGATGGGAG AAGG AG AC C A AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 600 

CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

15 CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 7 00 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 8 00 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 85 0 

CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

2 0 GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 9 50 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGKCCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 1149 

25 



2 ) INFORMATION FOR SEQ ID NO : 405 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 
_ j B j _ - T yp E - NUc i e i c "acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nlgrer 
40 (B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

45 GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 100 

CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC 150 

CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 200 

TGACTGCGCT ATCCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 250 

GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 300 

50 CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 350 

GTGGTCCGAG G AC C GTT AC A ACGAAATCGT T AAGG AG AC C TCCAACTTCA 400 

. TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 450 

GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 5 00 

CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAG AC CC 550 

55 TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTCCAACAAG 6 00 

CCCCTCCGTC TTCCCCTCCA GGATGTC T AC AAGATCTCCG GTATTGGAAC 65 0 

TGTTCCCGTC GGTCGTGTCG AGACCGGTAT CATTGCCCCT GGTATGGTCG 7 00 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 7 50 

CACCACCAGC AGCTCAAGGA AGGTGTCCCC GGTGACAACG TTGGTTTCAA 800 

60 CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG CCGTGGTAAC GTTGCCGGTG 850 
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10 



20 



50 



ACTCCAAGAA CGACCCCCCT CTTGGCTGTG AGAGCTTCAC CGCCCAGGTC 900 

ATCGTCCTCA ACCACCCCGG TCAGGTCGGC GCTGGTTACG CTCCCGTCCT 950 

GGACTGCCAC ACTGCTCACA TTGCTTGCAA GTTCGCTGAG CTCCTTGAGA 1000 

AGATTGACCG CCGTACCGGA AAGTCTGTTG AATCTTCCCC CAAGTTCATC 1050 

AAGTCCGGTG ACGCTGCCAT CGTCAAGATG ATTCCCTCCA AGCCCATGTG 1100 

TGTTGAGGCT TTCACTGACT ACCCCCCTCT TGGTCGTTTC GCCGTCCGCG 1150 

A 1151 



2 ) INFORMATION FOR SEQ ID NO: 406 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1093 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capltatus 

(B) STRAIN: ATCC 10663 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 

GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATATC GCTCTCTGGA 50 

AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATGC TCCAGGTCAC 100 

CGTGATTTCA TCAAGAACAT GATTACTGGT ACTTCCCAAG CCGATTGCGC 150 

3 0 CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 2 00 

AGGAAGGTCA AAGGAGAGAA CAGGCTCTTC TCGCTTTCAC CCTTGGTGTC 250 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 3 00 

AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 400 

35 GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 450 

GACCAAGGAA AC C AAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 5 00 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

CGGCCGTGTC GAAAC C GGT A TCATCAAGGC CGGTATGGTC GTTACCTTTG 65 0 

40 CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 7 50 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 800 

ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 850 

AACCACCCTG GTCAAATCTC TGC TGGTTAC TCTCCAGTTC TCGATTGCCA 900 

45 CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 1000 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGCCAATGT GCGTTGAAAC 1050 

CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 1093 



2 ) INFORMATION FOR SEQ ID NO: 407 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 



CTTGGACAAA 
CTTTGTGGAA 

10 CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 

15 TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 

2 0 TGTGC C AGTC 
TTACTTTCGG 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 

2 5 ATTGTTTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 

30 A 



TTGAAGGCTG 
ATTCGAAACT 
G AG ATTTC AT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGC C ATTGC A 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAAC TGGT 
ATGCTGC TAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAATATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAAC TGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTYGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTC AAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 15 0 

TTCGAAGCCG 2 00 

GGCTTACACT 25 0 

ACTCTGTCAA 3 00 

TCCAACTTCG 350 

TCCAATCTCT 40 0 

GTCC ATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGT AC 600 

GGTATGGTTG 650 

CGTTGAAATG 7 00 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCC AAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GC TGTC AG AG 1100 
1101 



2 ) INFORMATION FOR SEQ ID NO: 408 



35 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1089 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

5 0 GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 50 

TCGAAACTCC AAAATACCAC GTT AC CGTC A TTGATGCTCC AGGTCACAGA 100 

■ GATTTCATCA AGAATATGAT CAC TGGT ACT TCTCAAGCTG ATTGTGCTAT 150 

TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 200 

ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 250 

55 CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 300 

CAGATTTGAA GAAATCATCA AGGAAACCTC CAACTTCGTC AAGAAGGTTG 35 0 

GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 400 

GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 450 

AAAGGAAACC AAATC CGGTA AAGTTACTGG TAAGACCTTG TTAGAAGCTA 500 

6 0 TTGACGCTAT TGAACCACCA ACCAGACCAA CCGACAAACC ATTGAGATTG 550 
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CCATTGCAAG ATGTTTACAA 
TAGAGTTGAA ACTGGTATCA 
CAGCTGGTGT TACCACTGAA 
TTGGCTGAAG GTGTTCCAGG 
5 TTCCGTTAAA GAAATTAGAA 
ATCCACCAAA GGGTTGTGAC 
CATCCAGGTC AAATCTCTGC 
TGCCCACATT GCTTGTAAAT 
GAACTGGTAA GAAATTGGAA 
10 GCTGCTATCG TCAAGATGGT 
CACTGACTAC CCACCATTAG 



GATCGGTGGT ATTGGTACTG 
TCAAAGCCGG TATGGTTGTT 
GTCAAATCCG TTGAAATGCA 
TGACAATGTT GGTTTCAACG 
GAGGTAACGT TTGTGGTGAC 
TCTTTCAATG CCCAAGTCAT 
TGGTTACTCT CCAGTCTTGG 
TCGACACTTT GGTTGAAAAG 
GAAAATCCAA AATTCGTCAA 
CCCAACCAAA CCAATGTGTG 
GTAGATTCGC TGTCAGAGA 



TGCCAGTCGG 600 

ACTTTCGCCC 650 

TCACGAACAA 7 00 

TTAAGAACGT 750 

TCCAAGAACG 800 

TGTTTTGAAC 850 

ATTGTCACAC 900 

ATTGACAGAA 950 

ATCCGGTGAT 1000 

TTGAAGCTTT 1050 
1089 



15 2 ) INFORMATION FOR SEQ ID NO: 409 



20 



25 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 56884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 



CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC 
CTTTGTGGAA ATTC G AAACT CCAAAATACC ACGTTAC CGT 
CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA 
TGATTGTGCT ATTTTGATTA TTGCTGGTGG TAC TGGTGAA 
35 GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT 
TTGGGTGTCA AACAATTGAT TGTTGC TGTC AACAAGATGG 
ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC 
TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT 
GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT 
40 CAAGGGTTGG GAAAAGGAAA CCAAATCCGG T AAAGTT AC T 
TGTTAGAAGC TATTG AC GCT ATTGAACCAC CAACCAGACC 
CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG 
TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC 
TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC 
45 CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG 
CGTTAAGAAC GTTTC CGTTA AAGAAATTAG AAGAGGTAAC 
AC TCC AAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA 
ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT 
GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT 
50 AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC 
AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA 
TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC 
A 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 200 

GGCTTACACT 250 

ACTCTGTCAA 300 

TCCAACTTCG 350 

TCCAATCTCT 400 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTC GTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1101 



55 



2) INFORMATION FOR SEQ ID NO: 410 



60 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1102 bases 
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(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 



CTTGGACAAA 
CTTTGTGGAA 

15 CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 

20 TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 

2 5 TGTGCCAGTC 
TTACTTTCGC 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 

30 ATTGTTTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 

35 AT 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AAC AG AT TTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGC TG A 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAATATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATC AC TGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
AC TCTTTC AA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 5 0 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 200 

GGCTTACACT 2 50 

ACTCTGTCAA 3 00 

TCCAACTTCG 35 0 

TCCAATCTCT 400 

GTCCATGGTA 45 0 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTC AA 750 

GTTTGTGGTG 8 qq 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AAC C AATGTG 1050 

GCTGTCAGAG 1100 
1102 



40 



2 ) INFORMATION FOR SEQ ID NO: 411 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411 

55 CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATC AC TGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

60 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

450 
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ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

5 TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AAC CG AC AAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 7 00 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

10 CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 9 00 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

15 AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGG TAG ATTC GCTGTCAGAG 1100 

AT 1102 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 412 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

- (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubllnlensis 

(B) STRAIN: NCPF 3108 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATC G 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCAC TGGTA CTTCTCAAGC 150 

40 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

45 GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 5 00 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 5 50 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

50 TC AC TTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 7 00 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 900 

55 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A 1101 

60 
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2 ) INFORMATION FOR SEQ ID NO: 413 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 
15 (B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 

GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

2 0 TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 15 0 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 25 0 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 3 00 

25 GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 3 50 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATC TC TGGC 400 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 45 0 

GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 5 50 

30 TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTAC GGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 7 00 

CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 7 50 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

3 5 CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 9 50 

TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

40 TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAG ATTCGC C GTCAGAGA 1098 



2 ) INFORMATION FOR SEQ ID NO: 414 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 ( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 

60 CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

452 



CA 02307010 2000-05-19 



CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATC AC TGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

5 TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTCG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 5 00 

10 TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AAC CG AC AAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 7 00 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

15 CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

2 0 AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AAC C AATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GC TGTC AG AG 1100 

AT 1102 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 415 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1102 bases 
3 0 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubllnlensis 

(B) STRAIN: CBS 7987 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

45 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

50 GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 65 0 

55 TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 7 00 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 7 50 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

60 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

453 
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AGATTGACAG AAGAACTGGT 
AAATCCGGTG ACGCTGCTAT 
TGTTG AAGC T TTCACTGACT 
AT 



AAGAAATTGG AAGAAAATCC 
YGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 



AAAATTCGTC 1000 
AACCAATGTG 105 0 

GCTGTCAGAG 1100 
1102 



2 ) INFORMATION FOR SEQ ID NO: 416 

10 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 
20 (B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 

AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

2 5 AAATTCGAAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 150. 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 200 

AAGGATGGTC AAACCAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 250 

TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 300 

3 0 AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 350 

GTTGGTTACA AC CC TAAG AC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 400 

CGGTGACAAC ATGATTGAAG CCTCCACCAA OTGTCCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 500 

GCCATTGATG CCATTGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

3 5 ATTACCATTA CAAGATGTCT ACAAGATCGG TGGTATTGGT ACTGTGCCAG 600 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGCACCACGA 700 

ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 7 50 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTCCAAG 800 

40 AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 850 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 900 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

AGAAGATCCG GTAAGAAATT AGAAGACGAA CC AAAATTC G TCAAGTCCGG 1000 

TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 1050 

4 5 CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 1094 



50 



2 ) INFORMATION FOR SEQ ID NO: 417 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Candida glabrata 

454 
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(B) 



STRAIN: ATCC 66032 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 



5 TTTGGACAAG 
CTTTGTGGAA 
CCAGGTCACA 
TGACTGTGCT 
GTATCTCCAA 

10 CTAGGTGTTA 
GTGGGATGAA 
TCAAGAAGGT 
GGTTGGAACG 
CAAGGGTTGG 

15 TGTTGGAAGC 
CCATTGAGAT 
GGTGCCAGTC 
TTACCTTCGC 
CACCACGAAC 

2 0 CGTTAAGAAC 
ACTCCAAGAA 
ATTGTCTTGA 
GGACTGTCAC 
AGAACGACAG 

2 5 AAGTCCGGTG 
TGTCGAAGCT 
A 



TTGAAGGCTG 
GTTCGAAACT 
GAGATTTCAT 
ATCTTGATTA 
GGATGGTCAA 
GACAATTGAT 
TCCAGATTCG 
CGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
CATTGACGCT 
TGCCATTGCA 
GGTAGAGTCG 
CCCAGCTGGT 
AATTG AC TG A 
GTTTCCGTTA 
CGACCCACCA 
ACCACCCAGG 
ACCGCCCACA 
AAGATCCGGT 
ACGCTGCTTT 
TTCTCCGACT 



AAAGAGAAAG 
CCAAAGTACC 
CAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTYGCTGTC 
CTGAAATCGT 
CCAAAGACTG 
GATTGAAGCC 
CCAAGGCTGG 
ATCGAACCAC 
AGATGTCTAC 
AAACCGGTGT 
GTTACCACTG 
AGGTTTGCCA 
AGG AAATC AG 
AAGGCTGCTG 
TCAAATCTCT 
TTGC TTGTAA 
AAGAAGTTGG 
GGTTAAGTTC 
ACCCACCATT 



AGGTATCACT 
ACGTY ACCGT 
ATTACTGGTA 
TGTCGGTGAA 
ACGCTCTATT 
AACAAGATGG 
TAAGGAAACC 
TTCCATTCGT 
ACCACCAACG 
TGTCGTCAAG 
CAACCAGACC 
AAGATCGGTG 
CATCAAGCCA 
AAGTCAAGTC 
GGTGACAACG 
AAGAGGTAAT 
CTTCTTTCAA 
GCTGGTTACT 
GTTCGAAGAA 
AAGACTCTCC 
GTTCCATCCA 
GGGTAGATTC 



ATCGATATCG 50 

TATCGATGCY 100 

CTTCTC AAGC 150 

TTCGAAGCYG 200 

GGCTTTCACC 2 50 

ACTCTGTCAA 3 00 

TCCAACTTCA 350 

CCCAATCTCT 4 00 

CTTCCTGGTA 45 0 

GGTAAGACCT 500 

AACTGACAAG 550 

GTATCGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTCTGTGGTG 800 

CGCTACCGTC 850 

CTCCAGTTTT 900 

TTGTTGGAAA 950 

AAAGTTCTTG 1000 

AGCC AATGTG 1050 

GCTGTCAGAG 1100 
1101 



30 



40 



2) -INFORMATION -FOR -SEQ ID NO: 418" 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1080 bases 
35 (B) TYPE: Nucleic acid 

( C ) STRAND EDNE S S : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida gullliermondil 

(B) STRAIN: ATCC 6260 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATC AC CAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT C AC TGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

50 GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 2 00 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 300 

GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 3 50 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 400 

55 TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGACCAA CCGACAAGCC ATTGAGATTG CCATTGCAAG 5 50 

ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 

ACCGGTATCA TYAAGGCCGG TATGGTTGTT ACCTTTGCCC CAGCTGGTGT 650 

60 YACCACTGAA GTCAAGTCCG TGGAAATGCA C C ACGAACAA TTGGTTGAAG 7 00 

455 
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10 



25 



45 



60 



GTGTTC C AGG TGACAATGTT GGTTTCAACG TTAAGAACGT TTCCGTTAAG 750 

GAAATTAGAA GAGGTAACGT TTGTGG TG AC TCCAAGAACG ACCCACCAAA 8 00 
GGGTTGTGAC TCTTTCACCG CTCAAGTTAT TGTGTTGAAC CACCCTGGTC 850 
AAATCTCTGC TGGTTACTCT CCAGTTTTGG ACTGTCACAC CGCCCACATT 9 00 

GCTTGTAAAT TCGACACCTT GTTGGAGAAG ATTGACAGAA GAACCGGTAA 9 50 

GAAGATGGAG GACAACCCCA AGTTTGTCAA GTCCGGTGAC GCTTCTATCG 1000 

TCAAGATGGT GCCATCCAAG CCAATGTGTG TTGAGGCTTT CACCGACTAC 1050 

CCACCATTGG GAAGATTCGC CGTCAGAGAC 1080 



2 ) INFORMATION FOR SEQ ID NO: 419 



( i ) SEQUENCE CHARACTERISTICS : 
15 % (A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi)ORXGTNAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 



TCTGTCAAGT GGGACAAGGC CAGATACGAG GAAATCGTCA AGGAGACCTC 50 

TAACTTCGTC AAGAAGGTTG GTTACAACCC TAAGACTGTT CCATTCGTCC 100 

3 0 CAATCTCTGG TTGGAACGGT GACAACATGA TTGAGGCTTC TACCAACTGT 150 

— G AC TGGTAG A -AGGGTTGGG A— G AAGG AG AC C— AAGTCTGGT.A .AGTCCACCGG 200 

TAAGACCTTG TTGGAGGCCA TTGACGCCAT TGAGCCACCA ACC AG AC C AA 250 

CCGACAAGCC ATTGAGATTG CCATTGCAGG ATGTCTACAA GATTGGTGGT 300 

ATCGGAACTG TGCCAGTCGG CAGAGTTGAG ACCGGTGTTA TCAAGGCCGG 350 

3 5 TATGGTTGTC ACCTTCGCCC CAGCTGGTGT CACCACTGAA GTCAAGTCTG 400 

TCGAGATGCA CCACGAGCAG TTGCCAGAGG GTGTCCCAGG TGACAACGTT 450 

GGTTTCAACG TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT 500 

C TGTGGTG AC TCCAAGCAGG ACCCACCAAA GGGCTGTGAC TCTTTCACCG 550 

CTCAGGTTAT TGTGTTGAAC CACCCAGGTC AGATCTCTTC TGGTTACTCT 600 

40 CCAGTTTTGG ACTGTCACAC TG CC C AC ATT GCTTGTAAGT TCGACACCTT 650 

GGTTGAGAAG ATCGACAGAA GAACCGGTAA GAAGTTGGAA GATGAGCCAA 700 

AGTTCATCAA GTCCGGTGAC GCTGCTATCG TCAAGATGGT CCCAACCAAG 7 50 
C 



2) INFORMATION FOR SEQ ID NO: 420 



( i j SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

(B) STRAIN: ATCC 16783 



751 



456 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 0 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

5 CCAGGTCACA GAGATTTCAT TAAGAACATG ATT AC TGGT A CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGC TGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 2 50 

TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 3 50 

10 TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTC AG AC C AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

15 TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 7 00 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 7 50 

CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 8 00 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

20 ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 9 00 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTC GATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTC C GTTG AAGACCATCC AAAGTC TGTT 1000 

AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

25 AT 1102 



2 ) INFORMATION FOR SEQ ID NO: 421 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

45 TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGT AC C AA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT T AC TGGTAC T TCTCAAGCTG 150 

ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 200 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 25 0 

50 GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 3 00 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 400 

TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT C C ATGGTAC A 450 

AGGGTTGGGA AAAGGAAACC AAGGC TGGTA CCGTCAAGGG TAAGACCTTG 500 

55 TTGGAAGCTA TTGACGCTAT CGAACCACCA AC C AG AC C AA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 650 

ACCTTCGCCC CAGCCGGTGT C AC T AC CG AA GTTAAGTCCG TCGAAATGCA 700 

CCACGAACAA TTGGAAGAAG GTCTACCAGG TGACAACGTC GGTTTCAACG 750 

60 TCAAGAACGT TTCCGTTAAG GAAATCAGAA GAGGTAACGT CTGTGGTGAC 800 

457 
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TCCAAGAACG ATCCACCAAA GGCTGCTGCT 
CGTCTTGAAC CACCCAGGTC AAATCTCTGC 
ATTGTCACAC TGCTCACATT GCTTGTAAGT 
AACGACAGAA GATCCGGTAA GAAGTTGGAA 
5 GTCTGGTGAC GCTGCTTTGG TTAAGTTCGT 
TTGAAGCATT CTCTGACTAC CCACCATTGG 



TCTTTCAACG CCACTGTTAT 85 0 

TGGTTACTCT CCAGTTTTGG 900 

TCGACGAATT GTTGGAAAAG 95 0 

GACTCTCCAA AGTTCTTGAA 1000 

TCCATCTAAG CCAATGTGTG 1050 

GTAGATTCGC TGTCAGAGA 1099 



10 2 ) INFORMATION FOR SEQ ID NO: 422 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 



AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 5 0 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 15 0 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

30 C AAGG ATGGT CAAACTAGAG AACACGCTCT ATTGGC TTTC ACCTTAGGTG 250 

• TTAG AC AA TT - GATTGTTGCT "ATCXACAAGA "TGGATTCCGT TAARTGGGAT 300 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 3 50 

GGTTGGTTAC AACCCAAAGA CTGTTC C ATT CGTTCCAATY TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

35 TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA C C TT ATT AG A 50 0 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT AC AAGATG TT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 650 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTCCGTTGAA ATGCACCATG 70 0 

40 AACAATTAGA ACAAGGTGTT C C AGGTG AT A AC GTTGGTTT CAACGTTAAG 750 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 800 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

45 CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GC TTTC AC TG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 



50 



60 



2) INFORMATION FOR SEQ ID NO: 423 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1104 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

458 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambi ca 

(B) STRAIN: ATCC 247 50 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

10 AGATTGTGCT ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 2 50 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 3 00 

GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 400 

15 GGTTGGAACG GTG AC AAC AT GATTGAACCA TCTGCTAACT GTCCATGGTA 450 

CAAGGGATGG AC T AAGG AAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 500 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GTC GGT AG AG TTGAAAC CGG TATCATCAAG CCAGGTATGA 650 

20 TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGC AC CATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 7 50 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 800 

GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

2 5 TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGC T A CTGAAGACCA TCCAAAGTCT 1000 

GTCAAGTCTG GTG ATGC AGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT 1050 

GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 
GAGA 



30 



40 



2) INFORMATION FOR SEQ ID NO: 424 

35 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



1104 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 
45 (B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 

GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATC AC CATC GATATCGCTT 50 

50 TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 200 

TCTCTAAGGA CGGTCAAACC AG AGAGC AC G CTTTGTTGGC TTACACCTTG 250 

GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 3 00 

5 5 GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 3 50 

AGAAGGTTGG TTACAACCCT AAG AC TGTTC CATTCGTCCC AATCTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGG AG AC C A AGTCYGGTAA GTCCACCGGT AAGAC CTTGT 500 

TGGAGGC CAT TGACGCCATT GAGCCACCTT CGAGACCAAC CGACAAGCCA 550 

60 TTGAGATTGC CATTGCAAGA TGTCTACAAG ATYGGTGGTA TTGGTACTGT 600 

459 
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GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT 
CCTTTGCTCC AGCTGGTGTC ACCACTGAAG 
CACGAACAAT TGGCTGAGGG TGTCCCAGGT 
CAAGAACGTT TCCGTCAAGG AAATCAGAAG 
5 CCAAGAACGA CCCACCAAAG GCTGCTGCTT 
GTCTTGAACC ACCCAGGTCA AATCTCCTCY 
CTGTCACACT GCYCACATTG CTTGTAAGTT 
TCGACAGAAG AACYGGTAAG AAGTTGGAAG 
TCYGGTGACG CTGCTATCGT CAAGATGGTC 
10 YGAAGCTTTC ACCGACTACC CACCATTGGG 



CAAGGCCGGT ATGGTTGTCA 650 

TCAAGTCCGT GGAAATGCAC 700 

GACAACGTTG GTTTCAACGT 750 

AGGTAACGTC TGTGGTGACT 800 

CYTTCACTGC TCAAGTYATY 850 

GGTTACTCTC CAGTYTTGGA 900 

CGACACCTTG ATTGAGAAGA 950 

AAGAGCCAAA GTTCATCAAG 1000 

CCAACCAAGC CAATGTGYGT 1050 

TAGATTCGCT GTCAGAGA 1098 



2 ) INFORMATION FOR SEQ ID NO : 425 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

( xi > SEQUENCE DESCRIPTION: SEQ ID NO: 425 

™ TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA TATTCGAAACTZeCA^TACC^CGTTACCGT -TATTG ATGC T 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTAC TGGT A CTTCCCAAGC 15 0 

TGATTGTGCT ATCTTAATTA TTGC TGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTTTATT AGCATTCACC 250 

35 TTAGGTGTTA AGCAATTAAT TGTTGCTATC AACAAGATGG ACTCTGTTAA 300 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 400 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTACTAACT GTCCATGGTA 45 0 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTTAAG GGTAAGACCT 500 

40 TATTAGAAGC TATTG ATGC T ATTGAACCAC CTGTCAGACC AACTGACAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTGT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAACA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

45 CGTCAAGAAC GTCTCTGTTA AGGATATTAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCAGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA GTTCGATGAA TTAATCGAAA 95 0 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

50 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 
A 



2 ) INFORMATION FOR SEQ ID NO: 42 6 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1095 bases 
60 (B) TYPE: Nucleic acid 

460 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GC TGAAAG AG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

g£S?£ga AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

SaS^gagatt TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

15 TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATC T 200 

SSggSgg TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 250 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGACTCAG TCAAATGGGA 3 00 

CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 3 50 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 400 

20 AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 600 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

25 TTGCCCCAGC TGGTGTTACC AC TG AAGTC A AGTCCGTTGA AATGC AC C AC 7 00 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 7 50 

GAAC GTTTC A GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 8 00 

AGAACGATCC AC CAAAGGG A TGTGAYTCCT TC AATGC TC A AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

3 0 TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

~ ACAGAAGAAe-CGGTAAGAAA -TTGGAAGATG AACCAAAATT CATCAAGTCC . 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA 1050 

AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 1095 



35 



50 



2 ) INFORMATION FOR SEQ ID NO: 427 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 



CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

55 CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 250 

ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 300 

TATCGGTACG GTACCTGTCG GCCGTGTCGA GACCGGTATC ATCAAGCCCG 3 50 

60 GCATGGTCGT CACTTTCGCC CCCGCTGGTG TC AC C AC TG A AGTG AAGTC C 400 

461 
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GTCGAGATGC ACCACGAGCA 
CGGTTTCAAC GTCAAGAACG 
TCTGCGGTGA CTCCAAGAAC 
GCCCAGGTCA TCGTTTTCAA 
5 CCCCGTTTTG GACTGCCACA 
TCCAGTCCAA GATGGACCGT 
AAGTTCATCA AGGCTGGTGA 
GC 



GATCCCCGAG GGTCTCCCCG 
TTACCGTCAA GGATATCCGC 
GACCCCCCCA AGGGCTGCTC 
CCACCCCGGT CAGATCTCCA 
CCGCCCACAT TGCCTGCCGC 
CGTACTGGTA AGACCCTTGA 
CTCCGCTATC GTCAAGATGG 



GTG AC AACGT 45 0 

CGTGGTAACG 5 00 

TTCCTTCACT 550 

ACGGTTACTC 600 

TTCGACGAGA 650 

GGAGAACCCC 7 00 

TTCCCTCCAA 7 50 
7 52 



2 ) INFORMATION FOR SEQ ID NO: 428 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1093 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNE S S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaenca 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 
^G?^S£ CTCCAAAGTA CCAAGTTACC G^ATCGATG CTCCAGGTCA 



50 
100 

30 SSSSSK S£XSX£ tgattactgg tact^tcaa GCTGACTGTG 150 
30 SSSStSS tattgctggt ggtgtcggtg aattcgaagc cggtatctcc 200 



SSSSSSSa aaaccagaga acacgc^tg ttggctttca ccttgggtgt 250 

tagacaattg attgttgctg ttaacaagat ggattccgtt AAGTGGGATG 300 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 3 50 

15 GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 400 

cggtcacaSc ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

SggaSagga AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 500 

g??aotgacg ctatcgaacc accatccaga CCAACTGACA AGCCATTGAG 550 

aStcccattg caagatgtct ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

40 TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTAC CTTT 650 

TCCCciGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 700 

GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 800 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AAC GC C AC TG TTATCGTCTT 850 

45 GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ScACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 109 i 



50 



2 ) INFORMATION FOR SEQ ID NO: 429 

55 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE SS : Double 

(D) TOPOLOGY: Linear 

60 

462 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 
5 (B) STRAIN: ATCC 13 803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 

AAATTGAAGG CTGAAAGAGA AAGAGGTATC ACCATTGATA TCGCTTTGTG 50 

10 GAAATTCGAA ACTCCAAAAT AC C ACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

GCTATTTTGA TTATTGC TGG TGGTACTGGT G AATTC GAAG CTGGTATTTC 200 

TAAAGATGGT CAAACCAGAG AACACGCTTT GTTGGCTTAC ACCTTGGGTG 2 50 

TCAAACAATT GATTGTTGCT GTCAACAAGA TGGACTCTGT TAAATGGGAC 3 00 

15 AAAAACAGAT TTGAAGAAAT TATCAAGGAA ACTTCTAACT TCGTCAAGAA 350 

GGTTGGTTAC AACCCTAAGG CTGTTCCATT CGTTCCAATC TCTGGTTGGA 400 

ATGGTGACAA CATGATTGAA GCTTCTACCA ACTGTCCATG GTACAAGGGT 450 

TGGGAAAAAG AAACCAAGGC TGGTAAGGTT ACCGGTAAGA CTTTGTTGGA 5 00 

AGCCATTGAT GCTATTGAAC CACCTTCAAG AC C AAC TG AC AAGCCATTGA 550 

2 0 GATTGCCATT GCAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 600 

GTC GGTAGAG TTGAAAC TGG TGTCATCAAA GCCGGTATGG TTGTT AC TTT 650 

CGCCCCAGCT GGTGTTAC C A CTGAAGTCAA ATCCGTCGAA ATGCACCACG 700 

AACAATTGGC TGAAGGTGTC CCAGGTGACA ATGTTGGTTT CAACGTTAAG 7 50 

AAC GTTTCTG TTAAAGAAAT TAGAAGAGGT AACGTTTGTG GTGACTCCAA 800 

2 5 GAACGATCCA CCAAAGGGTT GTGACTCTTT CAACGCTCAA GTTATTGTCT 850 

TGAACCACCC AGGTCAAATC TCTGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCTC ATATTGCTTG TAAATTCGAC ACCTTGGTTG AAAAGATTGA 950 

CAGAAGAACT GGTAAGAAAT TGGAAGAAAA TCCAAAATTC GTCAAATCCG 1000 

GTGATGCTGC TATTGTCAAG ATGGTTCCAA CCAAACCAAT GTGTGTTGAA 1050 

30 GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTCA GAGA 1094 
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2 ) INFORMATION FOR SEQ ID NO: 430 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 

50 CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT C AC T ATCG AC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCAAAG TACCACGTTA CTGTCATTGA TGCCCCAGGT 100 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 15 0 

TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 200 

CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 2 50 

55 GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 3 00 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 350 

AGGTTGGTTA C AACC CAAAG ACTGTTCCAT TTGTCCCAAT TTCYGGTTGG 400 

AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 450 

TTGGGAGAAG GAGACCAAGG CTGGTGTTGT CAAGGGTAAG ACCTTGCTCG 500 

60 ATGCCATTGA CGCCATTGAG CCACCAACAA GACCAACTGA CAAGCCATTG 55 0 

463 
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AG ATTGC C AC TCCAGGATGT CTACAAGATT 
AGTCGGTAGA GTCGAGACCG GTGTCATCAA 
TTGCCCCATC CGGTGTCACC ACTGAGGTTA 
GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC 
5 GAACGTCTCT GTTAAGGAGA TCAGAAGAGG 
AGAACGACCC ACCACAGGGT GCTGAGTCCT 
TTGAACCACC CAGGTCAGAT CTCTGCTGGT 
TCACACCGCC CACATTGCTT GTAAGTTCTC 
ACAGAAGATC CGGTAAGTCC CTTGAGGCCT 
10 GGTG ATGC C G CTATCGTCAA GATGGTTCCA 
GGCCTTCACT GACTACCCAC CACTCGGTAG 



GGTGGTATCG GAACTGTTCC 600 

GCCAGGTATG GTTGTTACCT 650 

AGTCCGTCGA GATGCACCAC 700 

AACGTTGGTT TCAACGTTAA 7 50 

TAACGTTGCC GGTGACTCCA 800 

TCAACGCTCA GGTCATTGTC 85 0 

TACTCTCCAG TTTTGGACTG 900 

TGAGCTTTTG GAGAAGATTG 95 0 

CTCCAAAGTT CGTCAAGTCT 1000 

TCCAAGCCAT TGTGTGTTGA 1050 

ATTCGCTGTC AGAGA 1095 



15 2 ) INFORMATION FOR SEQ ID NO : 431 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 

30 



GCTGAAAGAG AAAG AGGT AT CACCATCGAT ATCGCTTTGT GGAAATTCGA ~ 5 0 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 2 00 

35 TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 2 50 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 3 00 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 3 50 

CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 400 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 450 

40 GAAACCAAGG CTGGTAAGGT TACCGGTAAG AC TTTGTTGG AAGCCATTGA 500 

CGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AG ATTGC CAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 7 00 

45 CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 75 0 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 800 

ACCAAAGGGT TGTGASTCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 85 0 

CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

50 CGGTAAGAAG TTGGAAGAAA AC C C AAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 



2 ) INFORMATION FOR SEQ ID NO: 432 

( i ) SEQUENCE CHARACTERI STICS : 
(A) LENGTH: 1072 bases 
60 (B) TYPE: Nucleic acid 

464 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 

AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

15 TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 200 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGC TGTC 250 

AACAAGATGG ACTCCGTCAA GTGGGAC AAG AACAGATTCG AGGAGATTGT 300 

C AAGG AG AC C TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 350 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 400 

2 0 TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 450 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

2 5 AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 7 00 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 7 50 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 800 

CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 850 

GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 900 

3 0 ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 105 0 

GGGTC GTTTC GC TGTC AG AG AC 107 2 



35 
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2 ) INFORMATION FOR SEQ ID NO: 433 



( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 7 51 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitls 

(B) STRAIN: Silveira 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 



AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

55 CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 150 

C CTTGGTAC A AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGC AC AC TGG 200 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 300 

ATCGGAACAG TCCCAGTCGG CCGTGTCGAA ACCGGTGTTA TCAAGCCTGG 350 

60 TATGGTTGTG ACCTTCGCTC CTTCCAACGT C AC C AC TG AA GTCAAGTCCG 400 

465 
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TCGAAATGCA CCACCAGCAG CTCACCCAGG 
GGCTTCAACG TCAAGAACGT CTCTGTCAAG 
CGCTGGTGAC TCCAAGAACG ACCCACCAAA 
CCCAGGTCAT CGTCCTCAAC CACCCTGGTC 
5 CCAGTCCTTG ACTGCCACAC TGCCCACATT 
CCTCGAGAAG ATCGACCGCC GTACCGGTAA 
AGTTCATCAA GTCTGGTGAT GCCGCTATCG 



GTAACCCTGG TGACAACGTT 450 

GAAGTCCGCC GCGGTAACGT 500 

GGGCTGCGAC TCCTTCAACG 550 

AAGTCGGTGC TGGTTATGCC 600 

GCTTGCAAGT TCTCCGAGCT 650 

ATCCGTTGAG AACAACCCCA 7 00 

TCAAGATGGT TCCATCCAAG 750 

751 



2 ) INFORMATION FOR SEQ ID NO: 434 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1146 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNESS : Doubl e 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Cryptococcus albldus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 



AAGCTCAAGG C C GAGCG AG A GCGAGGTATC ACCATCGACA TCGCCTTGTG 50 

G AAGTTC GAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC GCCCCCGGTC 100 

3 0 ACCGAGACTT CATCAAGAAC ATGATC AC C G GTACCTCGCA GGCCGACTGT 150 

GCCATCCTCA~TCATCGCCTC-CGGTATCGGA--GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGACGGT ~C AG AC CCGAG AGCACGCCCT TTTGGCCTTC ACCCTCGGTG "250 

TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG CAAGGTTAGT 3 00 

TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT TTACAGTGGT 350 

3 5 CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG TTTCATCAAG 400 

AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA TCTCCGGATG 450 

GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC TGGTACAAGG 500 

GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA GACCTTGCTC 550 

GAGGCCATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG ACAAGCCCTT 600 

40 GCGATTGCCC CTCCAGGATG TCTACAAGAT CGGTGGTATC GGTACGGTGC 650 

CCGTCGGTCG AGTCGAGACC GGTGTCATCA AGGCTGGTAT GGTCGTCACC 7 00 

TTCGCCCCTG CCAACGTCAC CACCGAAGTC AAGTCCGTCG AAATGCACCA 750 

CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT TTCAACGTCA 800 

AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG CTCCGACTCG 850 

45 AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC AGGTCATTGT 900 

~~ CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC GTTTTGG AC T 950 

GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT TGAGAAGATC 1000 

GACCGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT TCGTCAAGTC 1050 

CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC ATGTGTGTCG 1100 

50 AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT CCGAG A 1146 



55 



2) INFORMATION FOR SEQ ID NO : 435 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doub 1 e 
60 (D) TOPOLOGY: Linear 

466 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Exophiala jeanselmex 

(B) STRAIN: ATCC 64755 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 

10 CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 2 00 

CCAAGGATGG TC AG AC TCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 25 0 

15 GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 3 00 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 350 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 400 

AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 45 0 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

20 AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 55 0 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 600 

CGTCGGTCGT - GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTC CGTCG A AATGCACCAC 7 00 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

25 GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 800 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 900 

CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 

30 GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 

GGCCTTCACT GACTACCCAC CTCTTGGTCG. TTTCGCCGTC CGTGA 1095 



35 2 ) INFORMATION FOR SEQ ID NO: 436 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Fusari um oxysporum 

(B) STRAIN: WSA-212 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 200 

55 ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAG C AC GCTCTTCTTG CCTACACCCT 300 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC AC C AC C AAGT 3 50 

GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC CTCTTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCTGTC GCTTTCGTCC CCATCTCCGG 450 

60 TTTCAACGGT GACAACATGC TTACCCCCTC CACCAACTGC CCCTGGTACA 500 
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GQA gcgtgaGATC AAGTCCGGCA AGCTCACTGG CAAGACCCTC 550 

™SocS ttgactccat CGAGCCCCCC AAGCGTCCCG TTGACAAGCC 600 

CCTTCGTCTT CCCCTTCAGG ATGTCTACAA GATCGGTGGT ATTGGAACGG 650 

ScCGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG TATGGTCGTT 700 

c ACCTTCGCTC CTTCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACGAGCAA CTCACTGAGG GCCAGCCCGG TGACAACGTT GGTTTCAACG 800 

SSaSScot CTCCGTCAAG GACATCCGAC GTGGTAACGT CGCTGGTGAC 850 

TCCAAGAACG ACCCCCCTAT GGGTGCCGCT TCTTTCACCG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGTC AGGTCGGTGC TGGTT AC GCT CCCGTCCTCG 950 

10 ACTGTCACAC TGCCCACATT GCCTGCAAGT TCGCCGAGAT CCAGGAGAAG 1000 

ATCGACCGCC GAACCGGTAA GGCTACTGAG GCCGCCCCCA AGTTCATCAA 1050 

GTCTGGTGAC ?CCGCCATCG TCAAGATGGT TCCCTCCAAG CCCATGTGTG 1100 
TTGAGGCTTT CAC 

15 

2 ) INFORMATION FOR SEQ ID NO: 437 

( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid^ 

(C) STRANDEDNE'S'S : Doiaisle 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geotrichum spp. 

(B) STRAIN: LEV- 4 



30 



;,:i(,xi,):sEQUENCEi^scRjpjr^^ 



GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 5 0 

AAGAAGGTTG GTTAYAACCC CAAGACTGTT GC TTTCGTCC CCATCTCTGG 100 

35 TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 15 0 

AGGGATGGCA G AAGG AG AC C AAGGCTGGTG TCACTAAGGG TAAGACCCTC 200 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 250 

CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 300 

TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGCCGG TATGGTCGTC 350 

40 ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 400 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 450 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 500 

TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 600 

45 ATTGCCACAC CGCCCATATT GCTTGCAAGT TCGACACTTT. GATCGAGAAG 65 0 

ATTGACCGTC GTAC TGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 725 



50 

2 ) INFORMATION FOR SEQ ID NO: 438 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 754 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G186A5 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CCAATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

10 CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 2 00 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 2 50 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 3 00 

TATTGGC AC T GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTC AAGTC C 400 

15 GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 450 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

20 TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 7 50 

GCCC 754 



25 



35 



2 ) INFORMATION FOR SEQ ID NO : 439 



{ i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 743 bases 
30 (B) TYPE : Nuc 1 elc„acid 
( C ) — STRANDEDNESS : -Double- 
CD) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalls 

(B) STRAIN: ATCC 6258 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

45 AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 200 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA AC C G AAAAGC 2 50 

CATTAAGATT AC C ATT AC AA GATGTTTACA AGATTGGTGG TATTGGT AC T 300 

GTGCCAGTCG G TAG AGTCG A AACCGGTGTC ATTAAGCCAG GTATGGTTGT 3 50 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 400 

50 ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 450 

" GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 5 00 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 600 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

55 GATTGACAGA AG AAC TGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 7 00 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 7 43 



60 2 ) INFORMATION FOR SEQ ID NO: 440 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1091 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 

CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 
GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 
CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 
CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 
CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 
GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 
GGAGGACCGC TTCAACGAGA TTGTCCGC G A AGTGTCGAAC TTCATCAAGA 
AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 
CACGGTGACA AC ATGATC GA GGCCACCACC AACATGCCTT GGTACAAGGG 
CTGGGAGAAG GAGACCAAGT CGGGCAAGGT C AC TGGTAAG ACTCTGCTGG 
ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 
CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 
TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 
TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTC GGTTG A GATGCACCAC 
GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 
GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 
AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 
ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 
CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 
ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 
GGTGACGCTG CCATGGTGGA G ATGATC CC C ACCAAGCCCA TGTGTGTGGA 
GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 



2) INFORMATION FOR SEQ ID NO: 441 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 749 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

AC C ACCAAGT AC TCGG AGG A CCGCTTCAAC GAGATTATTC GCGAGACTTC 
GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 
CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CACCAACATG 
CCGTGGTACA AGGGCTGGGA G AAGG AG AC C AAGTCGGGCA AGGCCACTGG 
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TAAGACCCTT CTGGACGCTA TTG ACGC CAT 
CCGACAAGCC TCTCCGTCTT CCTCTCCAGG 
ATCGGTACYG TCCCGGTCGG CCGTGTTGAG 
TATGGTTGTG ACCTTCGCTC CGTCGAACGT 
5 TCGAGATGCA CCACGAGCAG ATCCCTGAGG 
GGTTTCAACG TGAAGAACGT GTCGGTCAAG 
CGCCTCGGAC TCGAAGAACG ACCCGGCTCA 
CTCAGGTCAT TGTGATGAAC CACCCTGGTC 
CCRGTGC TCG ACTGCCACAC TGCTCACATT 
10 CCTCCAGAAG ATCGACCGTC GTTCGGGTAA 
AGTTCATCAA GTCGGGTGAC GCTGCCATGG 



TGAGCCGCCG ACGCGCCCGA 2 50 

ATGTGTACAA GATCGGTGGT 300 

AC C GGTGTT A TCAAGCCCGG 3 50 

CACSACTGAA GTTAAGTCGG 4 00 

GTCTTCCGGG TGACAACGTT 450 

G AC ATTCGC C GTGGTAACGT 500 

GGAGGCTGCC TCGTTCAATG 550 

AGATCAGCAA CGGTTACTCG 600 

GCCTGCCGCT TCAACAACAT 650 

GGTTCTYGAA GAGAACCCCA 7 00 

TTGAGATGAT CCCGACCAA 749 



15 2 ) INFORMATION FOR SEQ ID NO: 442 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 



30 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTG-AAGGCC-GAGGGTGAGC GTGGTATGAC -GATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 15 0 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 200 

35 ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 2 50 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 3 00 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 350 

TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 450 

40 TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA C AAGGAG AC C GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 65 0 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 7 00 

45 ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

TCCAAGAACG ACCCCGCCAA GGG CTGCG AC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

50 ATTGC C AC AC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

TTGAGGC TTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



2 ) INFORMATION FOR SEQ ID NO: 443 



( i ) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1099 bases 
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10 



45 



<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 



GGACAAGTTG AAGGCTGAGA GAGAGAGAGG T ATC AC CATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

15 GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 200 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 250 

GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 3 00 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

20 AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 400 

TGGAACGGTG ACAACATGAT TGAGGCYTCC AC T AACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 500 

TGGAGGC C AT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 600 

2 5 GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 7 00 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

3 0 GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG . CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AAC TGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 

35 

2 ) INFORMATION FOR SEQ ID NO: 444 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
50 (B) STRAIN: ATCC 42570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

55 GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 2 00 

GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 2 50 

TGGTATCTCC AAGGATGGCC AGACCCGTGA GCACGCTCTG CTCGCCTACA 3 00 

60 CCCTCGGTGT TAAGCAGCTC ATCGTCGCTA TCAACAAGAT GGACACCACC 350 
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AAGTGGTCTG AGGCCCGTTT CCAGGAGATC 
CATCAAGAAG GTCGGCTACA ACCCCAAGAC 
CTGGTTTCCA CGGCGACAAC ATGCTTTCCC 
T AC AAGGGC T GGGAGAAGGA GACCAAGGCT 
5 CCTCCTTGAG GCCATCGACT CCATCGAGCC 
AGCCCCTCCG CCTTCCCCTT CAGGATGTGT 
ACAGTCCCTG TCGGCCGTAT CG AG AC TGGT 
CGTGACCTTC GCTCCTTCCA ACGTCACCAC 
TGCACCACGA GCAGCTCTCC GAGGGTGTCC 

10 AACGTCAAGA ACGTCTCCGT CAAGGAGATC 
TGACTCCAAG AACGACCCCC CTCTGGGTGC 
TCATCGTCCT CAACCACCCC GGCCAGGTCG 
CTCGACTGCC ACACCGCCCA CATTGCCTGC 
GAAGATCGAC CGCCGTACCG GCAAGTCTGT 

15 TCAAGTCTGG CGACTCTGCC ATCGTCAAGA 
TGCGTTGAGG CTTTCACCGA CTACCCTCCT 
TGA 



ATC AAGGAG A CCTCCAACTT 400 

CGTCGCTTTC GTCCCCATCT 450 

CCTCCACCAA CTGCCCCTGG 500 

GGCAAGTCCA CCGGCAAGAC 550 

CCCCAAGCGC CCCAGCGACA 600 

ACAAGATCGG CGGTATCGGC 650 

GTCATCAAGC CCGGCATGGT 7 00 

CGAAGTCAAG TCCGTTGAGA 7 50 

CCGGTGACAA CGTCGGCTTC 800 

CGTCGTGGCA ACGTCGCCGG 850 

CGCTTCTTTC GATGCCCAGG 900 

GTGCTGGCTA CGCCCCCGTC 950 

AAGTTCGCCG AGATCAAGGA 1000 

CGAGTCCGCC CCCAAGTTCA 1050 

TGATTCCCTC CAAGCCCATG 1100 

CTGGGCCGCT TCGCCGTCCG 1150 

1153 
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2 ) INFORMATION FOR SEQ ID NO : 445 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 763 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32071 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

40 CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 2 00 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 3 00 

TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 400 

45 GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA C TCTAAG AAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

50 TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 7 00 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 7 50 

GCCCATGTGC GTC 763 



2 ) INFORMATION FOR SEQ ID NO: 446 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1346 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 
{ D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penlcillium marneffei 

(B) STRAIN: ATCC 64101 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 5 0 

GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 100 

AC CGTG ATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

15 GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 2 00 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 2 50 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 3 00 

CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TAC TAATGC A 3 50 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 400 

2 0 AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 45 0 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 5 00 

GTTGGGAGAA QGAGACCAAG GCGGG^APiGG TCACTGGTAA GACCCTCcTC 550 

GAGGCCATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

2 5 TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 700 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 750 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 800 

TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 900 

3 0 ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 95 0 

GGTAACGTTG - CTGGTGAC AG l^AAGAACGAC jCCCCCTGCCG "GTGCTGCCTC 1000 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

35 CCACCCCAAG TTCATCAAGT CCGGTGACGC TGC CATC GTC AAGATGATTC 1200 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 12 50 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTC CATCC 13 00 

TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 134 6 



40 



55 



2 ) INFORMATION FOR SEQ ID NO: 447 



( i ) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 



AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAATTCGAA ACTCCAAAAT ACCATGTTAC CGTTATTGAT GCTCCAGGTC 100 

60 ACAGAGATTT CATCAAAAAT ATGATTACTG GTACTTCCCA AGCTGATTGT 150 

474 



CA 02307010 2000-05-19 



GCTATYTTAA 
AAAAGATGGT 
TTAAACAATT 
GAATCTCGTT 
5 AGTTGGTTAC 
ATGGTGATAA 
TGGAAAAAAG 
AGCCATTGAT 
GTTTACCATT 

10 GTCGGTAGAG 
TGCCCCAGCT 
AACAATTGAC 
AATGTTTCTG 
AAAC GATCC A 

15 TAAACCATCC 
CACACTGCTC 
CAGACGTACT 
GTGATGCTGC 
GCTTTCACTG 

20 



TTATTGCCGG 
CAAACCAGAG 
GATTGTTGCT 
TCGAAGAAAT 
AACCCAAAAA 
CATGATTGAA 
AAACCAAAGC 
GCTATTGACC 
ACAAGATGTT 
TTGAAACCGG 
GGTGTT AC C A 
TGAAGGTTTA 
TTAAAGAAAT 
CCAAAAGCTG 
AGGTCAAATC 
ACATTGCTTG 
GGTAAGAAAT 
TATTGTTAAA 
ACTACCCACC 



TGGTATTGGT 
AACACGCTTT 
ATCAACAAGA 
TGTCAAGGAA 
CTGTTCCATT 
CCATCAWCTA 
TGGTGAAGCT 
CACCATCAAG 
TACAARATTG 
TGTTATCAAA 
CTGAAGTCAA 
CCAGGTGACA 
CCGTCGTGGT 
CTGAATCATT 
TCTGCTGGTT 
TAAATTTGAC 
TAGAAGAAGC 
TTTGTTCCAT 
ATTAGGTCGT 



GAATTCGAAG 
NTTAGCTTAC 
TGGATTCCGT 
ACYTCAAACT 
CGTTCCAATC 
ACTGTCCATG 
AAAGGTAAAA 
ACCAACTGAT 
GTGGTATTGG 
CCAGGTATGG 
ATCTGTTGAA 
ATGTTGGTTT 
AACGTCTGTG 
CAATGCTCAA 
ACTCTCCAGT 
ACTTTAATTG 
TCCAAAATTC 
CAAAACCATT 
TTCGCTGTCA 



CTGGTATCTC 2 00 

ACCTTAGGTG 2 50 

TAAATGGGAT 300 

TTATCAAGAA 3 50 

TCAGGTTGGA 400 

GTACAAAGGT 450 

CTTTATTAGA 500 

AAACCATTAC 550 

TACTGTGCCA 600 

TTGTTACCTT 650 

ATGCATCATG 7 00 

CAACGTTAAG 7 50 

GTGACTCTAA 800 

GTTATTGTCT 850 

TTTAGATTGT 900 

AAAAAATTGA 950 

ATC AAATC AG 1000 

ATCAGTTGAA 1050 

GAGA 1094 



2 ) INFORMATION FOR SEQ ID NO: 448 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

30 

- -(-i-i ) MOLECULE TYPE:. Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 
35 (B) STRAIN: ATCC 2149 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 

CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 5 0 

4 0 CTTTATGGAA ATTCGAAACT C C AAAATAC C ATGTT AC CGT TATTGATGCT 100 

CCAGGTGACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 200 

GTATCTCAAA AGATGGTCAA ACCAGAGAAC AC GCTTTRTT AGCTTACACC 250 

TTAGGTGTTA AACAATTGAT TGTTGC TATC AACAAGATGG ATTCCGTTAA 3 00 

45 ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 3 50 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 400 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 45 0 

AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

5 0 CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600, 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTC AAATC T GTTGAAATGC 7 00 

ATCATGAACA ATTGACTGAA GGTTTAC C AG GTGACAATGT TGGTTTCAAC 750 

GTTAAGAATG TTTC TGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

55 CTCTAAAAAC G ATC C AC C AA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGC TTGT AAA TTTGACACTT TAATTGAAAA 95 0 

AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 

AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA 1050 

60 GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA ' 1100 
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2 ) INFORMATION FOR SEQ ID NO: 449 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Pseudallescheria boydix 

(B) STRAIN: ATCC 44331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 

2 0 CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 5 0 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTCTTTG 100 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GGACAATTCC ASCCCTGSAT xoo 

AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 200 

CCCGGCCATC GTGATTTCAT CAAGAACATG ATT AC TGGTA CCTCCCAGGC 25 0 

25 TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 300 

GCATCTCCAA GGATGGCCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 350 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG ACACCAACAA 400 

CTGGTCCGAG GACCGATACA AGGAGATCAT CAAGGAGACC TCCAACTTCA 45 0 

TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

3 0 GGTTTCCACG GAGACAACAT GCTTAC CCCC TCCACCAACT GCCCCTGGTA 55 0 

^GAAGGGTTGG— GTCGGTGAGG TCAAGGGTAA CACCCTTACC _GGC AAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGATCGGTG GTATTGGCAC 7 00 

TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGG TCG 75 0 

35 TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 800 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 85 0 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

ACTCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

40 TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGACCG CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 1200 
A 



1201 



45 



2) INFORMATION FOR SEQ ID NO: 450 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
SO (B) STRAIN: ATCC 56015 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 

CAAGCTTAAG GCTGAACGTG AACGTGGTAT C AC CATC GAT ATTGCTCTCT 50 

5 GGAAGTTCGA AACCCCCAAG TACCAAATTA CCGTTATTGA TGCTCCCGGT 100 

CACCGTGATT TCATCAAGAA CATGATTACC GGTACTTCTC AAGCCGATTG 150 

TGC TATTCTT ATCATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 2 00 

CCAAGGATGG TCAAACCCGT GAACACGCCC TTTTGGCTTT CACTCTCGGT 2 50 

GTCCGTCAAT TGATTGTTGC TGTCAACAAG ATGGATACCA CCAAGTGGTC 3 00 

10 CGAAGCTCGT TTCAACGAAA TYGTCAAGGA AGTTTCTTCC TTCATCAAGA 35 0 

^ AGATTGGTTA CAACCCCAAG TCTGTTCCCT TCGTCCCCAT CTCTGGTTGG 400 

CACGGTGACA ACATGTTGGA AGAATCTACC AACATGCCCT GGTACAAGGG 450 

ATGGAACAAG GAAACCAAGG CTGGTGCCAA GTCTGGTAAG ACCCTCTTGG 500 

ATGCCATTGA CAACATTGAC CCTCCTACCC GTCCTGTTGA CAAGCCTCTC 550 

15 CGTCTTCCTC TTCAAGATGT TTACAAGATT GGTGGTATCG GTACTGTCCC 600 

CGTCGGTCGT GTC GAAACTG GTGTCATCAA GGCTGGTATG GTTGTC AC C T 650 

TCGCTCCTGC TGCTGTCACC ACTGAAGTTA AGTCCGTCGA AATGCACCAC 700 

GAAACCCTCA CTGAAGGTCT CCCCGGTGAC AACGTCGGTT TCAACGTCAA 7 50 

GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT TCTGACTCCA 800 

2 0 AGAACGACCC CGCCAAGGAA GCCGGTTCCT TCACCGCTCA AGTCATTATC 850 

TTGAACCACC CTGGTCAAAT TGGTGCTGGT TACGCTCCYG TTTTGGATTG 900 

TCACACTGCT CACATTGCCT GTAAGTTCGC TGAATTGATC GAAAAGATTG 950 

ACAGACGTTC CGGTAAGTCC TTGGAAGCTA CTCCCAAGTT CGTCAAGTCT 1000 

GGTGACTCTG CCATCGTCAA GATGATCCCC TCCAAGCCCA TGTGTGTTGA 1050 

2 5 AGCTTACACT GACTACCCTC CTCTCGGTCG TTTCGCTGTT CGTGA 1095 



30 



2 ) INFORMATION FOR SEQ ID NO: 451 



(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 



45 GCTGAAGGCC GAGCGAGAGC GTGGTATCAC CATCGATATC GCTCTATGGA 50 

AGTTCGAGAC CCCCAAGTAC AACGTCACCG TCATTGACGC TCCAGGACAT 100 

CGTGATTTCA TCAAGAGTGA GTTAAC CAT A ACATCAAACA GTGTTGCAAA 150 

CATCAGCTAA TGCATGTTAT GCGTCCAGAC ATGATTACTG GTACTTCCCA 200 

GGCCGATTGC GCTATTCTCA TCATCGCCAC CGGTGTTGGT GAGTTCGAGG 250 

50 CTGGTATCTC CAAGGATGGC CAGACCCGAG AGCACGCCCT TCTCGCCTTC 3 00 

ACCCTCGGTG TCAGACAGCT CATCGTTGCC TTGAACAAGA TGGACTCGGT 350 

CAAGGTAGGC TAACTTCACA ACGTCGGCTT CCCATCATTC ATTC AC TTAC 400 

CTGTCTTGTC TTCCACCCTC CAGTTCTCCG AGTCCCGATA CGATGAAATC 450 

GTCAAGGAGA CATCCGGTTT CATCAAGAAG GTC GGATTC G ACCCCAAGGG 500 

55 TGTTCCCTTC GTCCCCATCT CAGGATGGCA CGGAGACAAC ATGATCGAGG 550 

AGTCCACCAA CATGCCTTGG TACAAGGGAT GGAAGAAGAC CACCAAGACC 60 0 

GGCGAGTACA AGGGAAAGAC CCTGCTCGAG GCCATCGACT CCATCGAGCC 65 0 

CCCCACCCGT CCTACCGACA AGCCTCTCCG ACTTCCCCTC CAGGATGTCT 7 00 

ACAAGATTGG TGGTATCGGA ACAGTGCCAG TCGGACGAGT CGAGACTGGT 7 50 

60 ACCATCAAGG CTGGTATGGT CGTCACCTTC GCTC CTTCAG CTGTCACCAC 800 
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CGAAGTCAAG TCTGTTGAGA TGCACCACGA 
CAGGTGACAA CGTCGGATTC AACATCAAGA 
CGAAGAGGAA ACGTCTGCGG TGACTCCAAG 
TGCTTCCTTC ACCGCCCAGG TCATTGTCCT 
5 GTAACGGATA CTCTCCAGTT CTCGATTGCC 
CATATTAGTT TGAACTCTTT TGAACAATAC 
TTCAGCCCAC ATTGCATGCA AGTTCGACAC 
GACGATCCGG AAAGTCCATC GAAGATACCC 
GACGCCGCCA TCGTCAAGAT GGTCCCCACC 
10 TTTCACCGAC TACCCACCTC TTGGACGATT 



GCAGCTCGAG GCTGGTCTTC 850 

ACGTTTCAGT CAAGGATATC 900 

AACGATCCCC CCAAGGAGGC 950 

CAACCACCCC GGTCAAATCG 1000 

ACACTGGTGA GTCATTCTTC 1050 

TAACGTGAAT CATTATACTT 1100 

CCTCCTAGAG AAGATTGACC 115 0 

CCAAGTTCGT CAAGTCTGGT 1200 

AAGCCAATGT GCGTTGAGGC 125 0 

CGCCGTCCGT GA 12 92 



2) INFORMATION FOR SEQ ID NO: 452 

15 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1289 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic SNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Sporobolomyces salmomcolor 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

3 0 AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCC AAGG .^TGCGTTCTC A CCCCGGCTGA GGAGCACGCA 100 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 200 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 250 

3 5 CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 300 

ACCCTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 400 

TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 450 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

40 GTACAAGGGA TGGAAGAAGG AG AC C AAGGC CGGTGAGGTC AAGGGCAAGA 55 0 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 7 00 

GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 7 50 

45 TCAAGGCCGG TATGGTC GTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTC CGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 1000 

50 ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 1050 

^ CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 12 00 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 1250 

55 CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 1289 



2 ) INFORMATION FOR SEQ ID NO : 453 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 



(C) STRANDEDNESS : Double 



5 



(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Sporothrix schenckil 

(B) STRAIN: WSA-148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

15 GTGAGCGCGG TATC AC CATC GATATTGCTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 2 00 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 2 50 

2 0 CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 3 00 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 350 

AAG AC TGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 400 

TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGCTGGGAG AAGGAGGGCA 450 

AGAGCGGCAA GGTTACCGGT AAG AC TCTGC TGGACGCCAT TGACGCCGTC 500 

25 GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT CCCTGTCGGC CGTATCGAGA 600 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 650 

ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 7 00 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

30 AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCGGCC A- -CCTTCAACGC-CC AGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGGCAAC GGCTACGCCC CGGTTC TGG A CTGCCACACC GCCCACATTG 900 

CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTG AC G CCGCCATTGT 1000 

35 CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 107 0 



40 2 ) INFORMATION FOR SEQ ID NO: 454 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1092 bases 

(B) TYPE: Nucleic acid 



45 



(C) STRANDEDNESS: Double 



( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus clferrli 

(B) STRAIN: ATCC 52550 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 



55 



50 
100 
150 
200 
250 



60 



CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 
ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 
GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 
ATCCTTATCA TTGC TGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA 
GGACGGTCAG AC CAG AGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 
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GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 300 

gatcgtttca ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 350 

?GG??ACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 400 

GTGACAATAT GATTGAGCCA TCCACCAACT GCCCATGGTA CAAGGGTTGG 450 

5 GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 

PATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC "U 

5tcc?cttca GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

ggtcgSgttg AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 650 

rorAGCTGGT GTCACCACTG AAGTCAAGTC TGTCGAGATG CACCACGAAC 700 

io agatccSSa AGGTACCCCA GGTGACAACG TTGGTTTCAA CGTCAAGAAC 750 

10 Stctccctca AGGAAATCAG ACGTGGTAAC GTTACTGGTG actccaagaa 800 

CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTCA 850 

Scaccctgg TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 

ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 950 

15 ACGTTCCGGT AAGAAGGTCG AAGACTCTCC TAAGTTCGTC AAGGC CGGTG 1000 

^gSgccat TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 1092 



20 



30 
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2 ) INFORMATION FOR SEQ ID NO: 455 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1149 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagropnytes 

(B) STRAIN: WSA-225 



3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

40 TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 200 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 300 

TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC AC CAAC TGGT 350 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTC AC C AA CTTCATCAAG 400 

45 AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

CTCGAGGCCA TCGATGC CAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

50 TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

55 CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC CGTCCGTGA 1149 
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2 ) INFORMATION FOR SEQ ID NO : 456 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

( C ) STRAND EDNES S : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 
15 (B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 

TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 

2 0 CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 

CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 200 

GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 2 50 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 3 00 

25 GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 400 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 4 50 

CAAGGGATGG AC C AAGG AAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 5 00 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 55 0 

30 CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAACCGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 7 00 

CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 7 50 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 

35 AC TCC AAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 85 0 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 1000 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

40 CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



45 2 ) INFORMATION FOR SEQ ID NO: 457 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dexmatitidls 

(B) STRAIN: WSA-229 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 

481 
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GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 15 0 

CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 2 00 

5 AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 250 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 3 00 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 4 50 

10 GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 5 00 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 6 00 

CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 7 00 

15 CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 95 0 

2 0 GTCGTACCGG AAAGTC CATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 108 5 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 458 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 492 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDED NESS: Doubl e 

(-D-) TOPOLOGY-: --Li-near- — 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL-95 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

45 GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 200 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 250 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 300 

CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 3 50 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 400 

50 CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 492 



55 2 ) INFORMATION FOR SEQ ID NO: 459 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

482 
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10 



35 



45 



(D) TOPOLOGY: Linear 
{ i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capita tus 

(B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 



GTCCGTGGTC AAGAAGTTAT TG AC AC TGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 2 00 

15 TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 250 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 3 00 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 350 

CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 400 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

2 0 CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 65 0 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 7 00 

2 5 TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT G AC TGATCC T 7 50 

GCCCCAGCCA CCACTTTCGC TCACTTGGAC GCCACCACTG TCTTGTC TCG 800 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 9 00 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

3 0 CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 1000 

TCG AACGTGC CCGTAAGATC CAAAGATTCC _TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTAC GAT CACCTTCCAG 1150 

AAGC 1154 



2 ) INFORMATION FOR SEQ ID NO: 460 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 
50 (B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 

CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 5 0 

55 AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA G AGC TATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 15 0 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 2 50 

GGTCCAATTG AATGTAAGGA AAAGAAACCA ATTCACGCTG AACCACCATC 3 00 

60 ATTC GTTGAA CAATCCACTG CTGCCGARAT TTTGGAAACC GGTATCAAGG 3 50 

483 
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TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAARAT TGGTTTATTC 400 

GGTGGTGCTG GTGTCGGTAA GACCGTCTTT ATCCAAGAAT TGATTAACAA 450 

CATTGCCAAA GCCCATGGTG GTTTCTCTGT CTTTACCGGT GTYGGTGAAA 500 

Saaccagaga AGGTAACGAT TTGTACCGTG AAATGAAAGA AACCGGTGTC 550 

5 ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTCTTCG GTCAAATGAA 600 

CGAACCACCA GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CTGAATACTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTCATTGAT 700 

AACATTTTCA GATTCACCCA AGCTGGTTCC GAAGTGTCTG CTTTGTTAGG 750 

TCGTATTCCA TCTGCTGTCG GTTATCAACC AACCTTAGCC ACTGATATGG 800 

10 GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAAGGTTC CGTCACCTCT 850 

10 otc?a!gc?g TCTATGTCCC AGCTGATGAT TTGACCGATC CTGCTCCAGC 900 

CACTACATTC GCCCATTTGG ATGC CACTAC TGTCTTGTCT AGAGGTATTT 950 

CTGAATTGGG TATCTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 1000 

TTATTGGACG CTTCTGTTGT TGGTCAAGAA CATTACGATG TCGCTACTGG 1050 

15 TGTTCAACAA ACTTTACAAG CTTACAAATC CTTACAAGAT ATCATTGCTA 1100 

TTTTGGGTAT GGATGAATTG TCTGAAGCTG ATAAATTGAC TGTCGAAAGA 1150 

GCCCGTAAGA TCCAAAGATT CTTGTCTCAA CCATTCGCTG TTGCTGAAGT 12 00 

TTTCACTGGT ATCCCAGGTA GATTAGTCAG ATTGCAAGAC ACTGTCAAAT 1250 

CATTCAAGGA TGTTTTGGAA GGTAAATACG ATAACTTGCC AGAAA 1295 



20 



2 ) INFORMATION FOR SEQ ID NO : 461 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1277 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(.iTXMOLECULE""TYPE : Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
35 (B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 

TAACTTGCCA GCTATTTTGA ATGC TTTG AC TTTGAAGAAC GGTGACCAAG 50 

40 ATTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC CGTCAGAGCT 100 

ATTGCTATGG ATGGTACTGA AGGTTTAGTC AGAGGTACTG AAGTCAACGA 150 

TACCGGTGCC CCAATCTCCG TTCCAGTCGG TAGAGGTACC TTAGGTAGAA 200 

TCATCAATGT TGTTGGTGAA CCAATTGATG ACAGAGGTCC AATTGAATGT 250 

AAGGAAAAGA AACCAATTCA TGCAGAACCA CCATCCTTCG TTGAGCAATC 300 

45 CACTGCTGCC GAAATTTTGG AAACCGGTAT CAAGGTTGTC GACTTATTGG 350 

CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG TGCTGGTGTC 400 

GGTAAGACCG TCTTTATCCA AGAATTGATT AACAACATTG CTAAAGCCCA 450 

TGGTGGTTTC TCCGTCTTTA CCGGTGTCGG TGAAAGAACC AGAGAAGGTA 500 

ACGATTTGTA CCGTGAAATG AAAGAAACCG GTGTCATCAA CTTAGAAGGT 550 

50 GACTCCAAGG TCGCTTTGGT CTTTGGACAA ATGAACGAAC CACCAGGTGC 600 

TAGAGCTAGA GTTGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCAGAG 650 

ATGAAGAAGG TCAAGATGTC TTGTTGTTCA TCGATAACAT TTTCAGATTC 700 

ACCCAAGCTG GTTCCGAAGT GTCTGCTTTG TTAGGTCGTA TTC CATC TGC 75 0 

CGTCGGTTAT CAACCAACCT TAGCTACTGA TATGGGTCTT TTGCAAGAAC 800 

55 GTATTACCAC CACCAAGAAA GGTTCCGTCA CCTCTGTCCA AGCTGTCTAT 850 

GTCCCAGCTG ATGATTTGAC CGATCCTGCT CCAGCCACCA CATTCGCCCA 900 

TTTGGATGCC ACTACTGTCT TGTCTAGAGG TATTTCTGAA TTGGGTATTT 950 

ACCCAGCTGT CGATCCATTG GATTCCAAAT C C AG ATT ATT GGACGCTGCC 1000 

GTTGTTGGTC AAGAACATTA TGATGTCGCT ACTGGTGTTC AACAAACTTT 1050 

60 GCAAGCTTAC AAATC C TTAC AAGATATCAT TGCTATTTTG GGTATGGATG 1100 
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AATTGTCTGA AGC TG AT AAA 
AGATTCTTGT CTCAACCATT 
AGGTAGATTA GTCAGATTGC 
TGGAAGGTAA ATACGATCAC 



TTGACTGTCG AAAGAGCCCG 
CGCCGTTGCT GAAGTTTTCA 
AAGAC AC TGT CAAATCATTC 
TTGCCAG 



TAAGATTCAA 1150 
CTGGTATTCC 1200 
AAGGATGTTT 12 50 
1277 



2 ) INFORMATION FOR SEQ ID NO: 462 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 
20 (B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 

GTAACTTGCC AGCTATTTTG AACGC TTTGA CCTTGAAGAA CGGTGAAAAC 50 

25 GAC TTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 

TATTGCTATG G ATGGT AC TG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 

ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 200 

ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 250 

CAAGGAAACC AGACCAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 3 00 

3 0 CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 3 50 

GCCCCTTACG ~CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGCCGGTGT 4 00 

CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 450 

ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 

AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 55 0 

35 TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 

C TAG AGC TAG AGTTGCTTTA ACCGGTTTAA CCATTGCCGA ATACTTCAGA 650 

GACGAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 7 00 

CACCCAAGCC GGTTC CGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 7 50 

CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 800 

40 AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 85 0 

CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 

ACTTGGATGC TACC AC TGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

TACCCAGCTG TCGATCCATT GG ATTC C AAA TCCAGATTGT TAGATGCTGA 1000 

TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

45 TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTC TG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC AC CGGTATC C 1200 

CAGGTAGATT AGTTAGATTG C AAGAC AC TG TTAAATCTTT CAAGGAAGTC 1250 

TTAGAAGGTA AATATGATCA CTTACCAG 1278 

50 



2 ) INFORMATION FOR SEQ ID NO: 463 

55 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

60 

485 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 
5 (B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 

TCTGGTCAGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 50 

10 CTGTCGGCAG AG AG ACC C TG GGCAGAATCA TCAACGTTAT CGGTGAACCT 100 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 200 

CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 2 50 

ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 3 00 

15 ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTTCACAG 3 50 

*~ GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 400 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 45 0 

CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC GCCTTGACCG 500 

GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 55 0 

20 CTGTTCGTCG AC AAC AT TTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

rrACCGATAT GGGTCTGTTG CAAGAAAGAA TTACGACGAC AAAGAAGGGT / w 

TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 7 50 

TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

2 5 CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTGCTAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 9 00 

CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 1000 

ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 1050 

3 0 TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 1100 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 1150 

CCAG 1154 



35 



45 



2 ) INFORMATION FOR SEQ ID NO : 464 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1293 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilllermondll 

(B) STRAIN: ATCC 62 60 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 

C C ACT AC GAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 5 0 

AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

55 TGCC AGCGTC ACTGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 2 00 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 250 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 300 

GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 350 

TTGTCGACTT GTTGGCTCCA TACGCCAGAG GTGGTAAGAT TGGATTGTTC 400 

60 GGTGGTGCCG GTGTCGGTAA GACTGTGTTT ATCCAAGAGT TGATTAACAA 450 
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CATTGCCAAG GCTCACGGTG GTTTCTCCGT 
G AAC C AG AG A AGGTAACGAT TTGTACCGTG 
ATCAACTTGG AAGGTGAATC CAAGGTGGCC 
CGAACCTCCA GG AGC TAG AG CCAGAGTTGC 
5 CTGAATACTT CAGAGATGAG GAGGGTCAAG 
AACATTTTCA GATTCACTCA AGCTGGTTCT 
TCGTATTCCT TCGGCTGTCG GTTACCAACC 
GTTTGTTGCA AG AGC GT ATT ACCACCACCA 
GTCCAAGCTG TCTATGTGCC AGCCGATGAT 

10 TACTACTTTT GCTCACTTGG ATGCTACCAC 
CCGAGTTGGG TATTTACCCA GCTGTCGATC 
TTGTTGGATG CCTCTGTTGT CGGTGAGGAG 
CGTTCAACAA ACCTTGCAAG CTTACAAGTC 
TTTTGGGTAT GGACGAATTG TCGGAAGCTG 

15 GCCCGTAAGA TCGAGAGATT CTTGTCTCAA 
TTTCACTGGT ATCAGTGGTA AGTTGGTCAG 
CTTTCAAGGA AGTCTTGGAA GGTAAGTACG 



GTTCACCGGT GTCGGTGAAA 500 

AAATGAAGGA AACTGGTGTC 550 

TTGGTGTTCG GTCAAATGAA 600 

CCTTACCGGT TTG AC CATC G 650 

ATGTGTTGTT GTTCGTCGAC 7 00 

GAAGTGTCGG CTTTGTTGGG 7 50 

TACTTTGGCC AC CG ATATGG 800 

AGAAGGGTTC CGTCACCTCT 850 

TTGACCGATC CTGCTCCTGC 900 

TGTGTTGTCT AGAGGTATCT 950 

CTTTGGATTC CAAGTCGAGA 1000 

CACTACTCGG TTGCTTCTAA 1050 

TTTGCAAGAT ATCATTGCCA 1100 

ACAAGTTGAC CGTCGAGAGA 1150 

CCATTTGCTG TTGCCGAAGT '1200 

ATTGGAGGAC ACTATCAGAT 1250 

ATCACTTGCC AGA 1293 



20 



2 ) INFORMATION FOR SEQ ID NO : 465 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1293 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 

CACTTTGACG ATGGTAACTT GCCAGCCATT TTCAACGCCT TGAAGTTGAA 50 

GAACGGTGAC C AGGAC TTGG TCTTGGAGGT CGCCCAGCAC TTGGGTGAGA 100 

ACACCGTCAG AACCATTGCC ATGGACGGTA CCGATGGTTT GGTCAGAGGC 150 

40 GAGGCTGTCA CTGACACTGG TGCTCCAATC TCCGTGCCTG TTGGTC GTG A 2 00 

GACTTTGGGT CGTATCATCA ACGTTATTGG TGAGCCAATT GAC GAG AG AG 250 

GACCAATCAA GTCCAAGAAG AGAAACCCAA TCCACACTGA CCCACCAACC 300 

TTC GTTGAGC AGTC TACTTC TGCTGAGGTT TTGGAGACTG GTATTAAGGT 3 50 

TGTCGACTTG TTGGCCCCTT ACGCCAGAGG TGGTAAGATT GGTTTGTTCG 400 

45 GTGGTGCCGG TGTCGGTAAG ACCGTCTTTA TCCAAGAGTT GATTAACAAC 450 

ATTGCCAAGG CCCACGGTGG TTTCTCTGTC TTTACCGGTG TCGGTGAGAG 500 

AAC C AG AG AA GGTAACGATT TGTACCGTGA AATGCAGGAG ACCGGTGTCA 550 

TCAACTTCGA GGGTGACTCC AAGGTCGCCT TGGTCTTCGG TCAGATGAAC 600 

GAGCCACCAG GAGC TAG AGC TAGAGTTGCT TTGACCGGTT TGACCATTGC 650 

50 CGAATACTTC AGAGATGAAG AAGGTCAGGA TGTGTTGTTG TTCGTTGACA 700 

ACATTTTCAG ATTCACTCAG GCTGGTTCCG AGGTGTCCGC CTTGTTGGGT 750 

C GT ATTC CAT CTGCTGTCGG TTACCAGCCA ACCTTGGCCA CCGATATGGG 800 

TACCTTGCAA GAAAGAATTA CCACCACCAA GAAGGGTTCC GTCACCTCTG 85 0 

TCCAGGCCGT TTACGTGCCA GCTGATGATT TGACCGATCC TGCCCCAGCT 9 00 

55 ACCACTTTCG CTCACTTGGA TGCTACCACT GTGTTGTCTA GAGGTATTTC 950 

CGAGTTGGGT ATCTACCCAG CTGTCGACCC ATTGGACTCC AAGTCTAGAT 100 0 

TGTTGGATGC CTCTGTTGTC GGTAAGGAGC ACTACGATGT TGCTACCAAC 1050 

GTCCAGCAGA CCTTGCAGGC CTACAAGTCC TTGCAGGATA TCATTGCCAT 1100 

TTTGGGTATG GATGAATTGT CCGAAGCCGA CAAGTTGACT GTCGAGAGAG 1150 

60 C T AG AAAG AT TCAGAGATTC TTGTCCCAGC CATTCGCCGT CGCCGAGGTT 12 00 
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TTCACTGGTA TCGAGGGTAA GTTGGTCAGA TTGGAGGAGA CCGTCAAGTC 12 50 

TTTC AAGG AG GTCTTGGATG GTAAGTACGA CCACTTGCCA GAG 12 93 



2 ) INFORMATION FOR SEQ ID NO : 466 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1111 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

(B) STRAIN: ATCC 16783 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 

AACSGTSGAT CTAAGTTAST CTTAGAAGTT GCTCAACATT T^fTGAAAA 50 

CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

AACCAGTTAA TG AT AC TGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 15 0 

25 ACTTTAGGTA GAATCTTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 2 00 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TCATCAAGAT CCTCCTGCTT 250 

TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 300 

GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 350 

TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 400 

3 0 TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TC AC TGGTGT CGGTGAAAGA 450 

ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA GTGGTGTTAT ■ . 500 

"'taacttagaa GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 550 

AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 65 0 

35 CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 700 

GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 750 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 800 

CCAAGCAGTT TATGTC CC AG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

CTACTTTCGC CCACTTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 900 

40 GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 1000 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 1100 

AAGAAAGATT C 1111 

45 

2 ) INFORMATION FOR SEQ ID NO : 467 

50 (i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 
60 (B) STRAIN: ATCC 28838 

488 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

5 CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 2 00 

TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 2 50 

AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 300 

10 TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 3 50 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 450 

CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA CCGGTGTCGG TGAAAGAACC 5 00 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 550 

15 CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGC TGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 7 00 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 800 

2 0 TTGCAAGAAA G AATTAC C AC TACCAAGAAG GGTTC CGTTA CCTCCGTCCA 850 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 9 50 

TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT CTAGATTGTT 1000 

GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

25 AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTC TGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 115 0 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 1200 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 1250 

AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 12 83 

30 



2 ) INFORMATION FOR SEQ ID NO : 468 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 
45 (B) STRAIN: ATCC 34135 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 

CTTCGAACAA GGCC AATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 50 

50 ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 100 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 15 0 

AACCGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 2 00 

CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 250 

CCAGTTGACT GTAAGGAAAG AAAGCCAATT CACGCTGATC CTCCAGCTTT 3 00 

55 CGTTGAACAA TCC AC TGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 3 50 

TCGATTTATT AGCACCTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 400 

GGTGC TGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 450 

YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

C C AG AGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 

60 AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 600 
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10 



15 



ACCACCAGGA 
AATATTTCAG 
ATTTTCAGAT 
AATTCCATCT 
TTTTACAAGA 
CAAGCTGTTT 
TACTTTCGCC 
AATTAGGTAT 
TTAGATGTTG 
CCAAGAAACT 
TGGGTATGGA 
AG AAAGATC C 
CACTGGTATT 
TCAGGGATGT 



GC TAG AGCAA 
AGATGAAGAA 
TCACCCAAGC 
GCTGTCGGTT 
AAGAATTACC 
ATGTCCCAGC 
CACTTGGATG 
CTACCCAGCT 
CAGTTGTTGG 
TTACAAGCTT 
TGAATTATCT 
AAAGATTCTT 
CCAGGTAAGT 
TCTTGCAGGT 



GAGTTGCTTT 
GGTCAAGATG 
AGGTTCTGAA 
ATCAACCAAC 
ACCACCAAGA 
CGATGATTTA 
CAACCACTGT 
GTCGATCCAT 
TCAAGAACAT 
ACAAGTCTTT 
GAACAAGATA 
ATCTCAACCA 
TAGTCAGATT 
AAGTACGATC 



AACTGGTTTG 
TCTTGTTATT 
GTCTCTGCAT 
TTTAGCAACC 
AGGGTTCCGT 
ACCGATCCTG 
CTTGTCYAGA 
TAGATTCTAA 
TATGAAGTTG 
ACAAGATATT 
AGTTAACYGT 
TTCTCTGTTG 
AGAAGAAACC 
ACTTACC 



ACCATTGCAG 650 

CATTGATAAC 7 00 

TATTAGGTAG 7 50 

GATATGGGTC 800 

TACTTCTGTC 850 

CTCCAGCTAC 900 

GGTATTTCCG 950 

GTC TAG ATT A 1000 

C AACTCAAGT 1050 

ATTGCTATTT 1100 

TGAAAGAGCA 1150 

CAGAAGTTTT 12 00 

ATCAAGTCTT 12 50 
1287 



20 



25 



30 



35 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 469 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida Iambi ca 

(B) STRAIN: ATCC 24750 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 469 



TCGAACAAGG 
GATGGTCAAA 
CGTCAGAACC 
CTGTTAACGA 
CTTGGTAGAA 
TGTCGAMTGT 
TTGACCAATC 
GATTTATTAG 
TGCTGGTGTC 
CTAAGGCCCA 
AGAGAAGGTA 
CCTTGAAGGT 
C AC C AGGAGC 
TACTTCAGAG 
TTTCAGATTC 
TTCCWTCTGC 
TTACAAGAAA 
AGCTGTTTAC 
CTTTCGCCCA 
TTAGGTATTT 
GGATGTCGCT 
AAGAAACTTT 
GGTATGGACG 
AAAGATCCAA 
CTGGTATCCC 
AGAGATGTCC 



TCAATTACCA 
AGTTAGTTTT 
ATTGCTATGG 
C AC TGGTGC T 
TCTTGAACGT 
AAGGAGAGAA 
CACTGAAGCT 
CACCTTACGC 
GGTAAGACCG 
CGGTGGTTTC 
ACGATTTATA 
GAATCTAAGG 
AAGAGCTAGA 
ATGAAGAAGG 
ACCCAAGCAG 
CGTTGGTTAC 
GAATTACCAC 
GTCCCAGCTG 
CTTGGATGCT 
ACCCAGCTGT 
GTTGTCGGTC 
ACAAGCTTAC 
AATTATCTGA 
AGATTCCTTT 
AGGTAAGCTT 
TCGATGGTAA 



C C AATTCTT A 
GGAAGTTGCT 
ACGGTACCGA 
CCAATCTCTG 
CACTGGTGAC 
GACCAATTCA 
GAAGTTTTGG 
TAGAGGTGGT 
TTTTCATCCA 
TCCGTTTTCA 
CAGAGAAATG 
TCGCCCTCGT 
GTTGCTCTTA 
TCAAGATGTC 
GTTCCGAAGT 
CAACCAACCT 
CACCAAGAAG 
ATGATTTRAC 
ACCACCGTCT 
CGATCCATTA 
AAGAACATTA 
AAGTCC TT AC 
ACAAGATAAG 
CCCAACCTTT 
GTCAGATTAG 
GTACGACCAC 



ACGCTTTGGT 

CAACATTTGG 

AGGTTTAGTT 

TCCCAGTTGG 

CCAGTYGATG 

CCAAGACCCA 

AAACCGGTAT 

AAGATTGGTT 

AGAATTGATT 

CTGGTGTCGG 

AAGGAAACCG 

TTTCGGTCAA 

CTGGTTTGAC 

TTGCTTTTCA 

GTCTGCTTTG 

TAGCAACCGA 

GGTTCCGTCA 

TGACCCTGCA 

TGTC TAG AGG 

GATTCTAAGT 

TACCGTTGCA 

AAGATATCAT 
CTTACTGTTG 

CTCCGTCGCA 
AAGAAACCAT 
TTA 



CATGGAAAAC 50 

GTGAAAACAC 100 

AGAGGTCAAC 150 

TAGAGGTACT 200 

AAAGAGGTCC 250 

CCTGCTTTCG 3 00 

TAAGGTTGTC 350 

TGTTCGGTGG 400 

AACAATGTTG 450 

TGAAAGAACC 500 

GTGTTATTAA 55 0 

ATGAACGAAC 600 

CATTGCTGAA 650 

TTGACAACAT 700 

TTAGGTAGAA 750 

TATGGGTTTG 800 

CCTCCGTCCA 85 0 

CCAGCAACCA 900 

TATTTCCGAA 950 

C TAG ATTATT 1000 

AC YC AAGTCC 1050 

TGCTATTTTG 1100 

AAAGAGCAAG 1150 

GAAGTTTTC A 1200 

TATTTCTTTC 12 50 
1283 
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2 ) INFORMATION FOR SEQ ID NO: 47 0 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1140 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 
15 (B) STRAIN: ATCC 6603 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470 

AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

20 GTGAAAC CTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATTCACA CTGAGCCACC 150 

ATCGTTTGTT GAAC AATCC A CTTCTGCTGA AGTTTTGGAG ACTGGTATCA 200 

AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 250 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 3 00 

25 CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT GGTGTCGGTG 3 50 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGACCGGT 400 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 500 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 55 0 

3 0 GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 65 0 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 7 00 

TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

35 TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 900 

TAACGTTCAA C AAAC TTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 1000 

AGAGCCAGAA AGATCCAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

40 GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG G AC AC TGTC A 1100 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 114 0 



45 2 ) INFORMATION FOR SEQ ID NO: 471 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1296 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 
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TCAATTCGAA CAAGGTCAAT TACCACCAAT TTTAAACGCT TTAGTCATGG 50 

ACAACGGTGG AACTAAGTTA GTTTTAGAAG TTGCTCAACA TTTAGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TCAACCAGTT AATGATACTG GTGCTCCAAT CTCTGTCCCA GTCGGTAGAG 2 00 

5 GTACTTTAGG TAGAATCTTA AAC G TT ATT G GTGATCCAGT CGATGAAAGA 2 50 

GGTCCAATCG ATTGTAAGGA AAGAAAACCA ATTCATCAAG ATCCTCCTGC 300 

TTTCGTTGAA CAATCAACTG AAGCTGAAGT TTTAGAAACT GGTATTAAAG 3 50 

TTGTCGATTT ATTAGCTCCT TACGCTAGAG GTGGTAAGAT TGGTTTATTC 400 

GGTGGTGCAG GTGTTGGTAA AACCGTTTTT ATTCAAGAAT TAATTAACAA 450 

10 TGTTGCAAAA GCTCATGGTG GTTTCTC CGT TTTCACTGGT GTCGGTGAAA 5 00 

GAACTAGAGA AGGTAACGAT TTATACAGAG AAATGAAGGA AACTGGTGTT 550 

ATTAATTTAG AAGGTGATTC TAAGGTCGCA TTAGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGAGCAAGAG CAAGAGTTGC TTTAACTGGT TTAACTATTG 650 

CTGAATATTT CAGAGATGAA GAAGGTCAAG ATGTC TTGTT ATTCATTGAT 7 00 

15 AACATTTTCA GATTTACTCA AGCAGGTTCT GAAGTTTCCG CTTTGTTAGG 750 
TAGAATTCCA TCCGCTGTCG GTTATCAACC AACTTTAGCA ACTGATATGG 800 
GTCTTTTACA AGAAAGAATT ACTACTACTA AGAAAGGTTC TGTTACTTCC 850 
GTTCAAGCAG TTTATGTCCC AGCAGATGAT TTAACTGATC CTGCTCCAGC 900 
AACTACTTTC GCCCACTTAG ATGCAACTAC TGTCTTATCT AGAGGTATTT 950 

20 CCGAATTAGG TATTT AC C C A GCTGTCGATC CATTAGATTC TAAATCTAGA 1000 

TTATTAGATG TTGC AGTTGT TGGTCAAGAA CATTATGATG TTGCAACTCA 1050 

AGTTCAKGKA ACTTTACAAG CTTACAAATC TTTACAAGAT AxTATrGC-rA lluu 

TTTTAGGTAT GGATGAATTA TCTGAACAAG ATAAATTAAC CGTTGAAAGA 1150 

GCTAGAAAAA TTCAAAGATT CTTATCTCAA CCTTTCTCTG TCGCAGAAGT 1200 

2 5 TTTCACTGGT ATCCCAGGTA AGCTTGTTAG ATTAGAAGAA ACTATTTCTT 1250 

CATTCAGAGA TGTCTTAGCA GGTAAGTACG ATCACTTACC AGAAAA 129 6 



3 0 2) INFORMATION FOR SEQ ID NO: 472 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 

45 



ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TACTTCTGTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 200 

50 GT AC TTTGGG TAGAATCATC AATGTTACTG GTGACCCAAT TGATGAAAGA 2 50 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 3 00 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 350 

TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA AACCGTGTTT ATCCAAGAAT TGATCAACAA 450 

55 CATTGCCAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

G AAC C AG AG A AGGTAACGAT TTGT AC CGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CTGAAT AC TT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 7 00 

60 AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 750 
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TCGTATTC C A TCCGCTGTCG GTTATCAACC 
GTCTTTTGCA AGAACGTATT ACCACTACCA 
GTCCAAGCCG TTTACGTGCC AGCTGATGAT 
T AC C AC TTTC GCTCACTTGG ATGCCACCAC 
5 CGGAGTTGGG TATTTACCCA GCTGTCGATC 
TTGTTGGATG CTGCCGTTGT TGGTCAAGAA 
TGTTCAACAA ACTTTACAAG CTTACAAGTC 
TCTTGGGTAT GGATGAATTG TCCGAACAAG 
GCTCGTAAGA TTCAAAGATT CTTGTCTCAA 
10 TTTCACTGGT ATTCCAGGTA AATTGGTTAG 
CATTCAAGGA AGTCTTGGAA GGTAAGTACG 



AACCTTGGCC ACCGATATGG 800 

AAAAGGGTTC AGTTACTTCA 850 

TTAACCGATC CTGCTCCAGC 900 

TGTGTTGTCT AGAGGTATTT 950 

CATTGGATTC CAAATCCAGA 1000 

CATTACGACG TTGCCACTGG 1050 

CTTGCAAGAT ATCATTGCTA 1100 

ATAAATTGAC TGTCGAAAGA 1150 

CCATTTGCTG TCGCTGAAGT 1200 

ATTGTCTGAA ACTGTCAAGT 12 50 

ATAACTTGCC AGAAAAT 1297 



15 2 ) INFORMATION FOR SEQ ID NO: 473 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1285 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 

GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 100 

AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 

CAC TGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 200 

35 GTACC CTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 250 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 3 00 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 350 

TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATC CAGGAGC TGATTAACAA 450 

40 CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 500 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 650 

CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 700 

45 AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 75 0 

TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 800 

GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 850 

GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 900 

CACCACCTTC GCCCATTTGG ATGCCACCAC TGTC TTGTCT CGTGCCATCT 950 

5 0 CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 1000 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 1050 

CGTTCAGCAG AC TTTGC AGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 115 0 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 12 00 

55 CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 1250 

• CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 12 85 



60 2 ) INFORMATION FOR SEQ ID NO: 474 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1283 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 



TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGC TTTGG A AATCGACACT 50 
CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 
TGTCAGAACC ATTGC TATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 150 
ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCCCAGTTGG T AG AGAAAC C 200 

2 0 TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 250 
AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 3 00 

TTGAACAATC C AC TG C.TGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 350 
G AC TTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 400 
TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 45 0 

25 CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 500 
AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 550 
CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 
CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 650 
TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 7 00 

30 TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 
TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 800 
TTGCAAGAAA GAATTACTAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 850 
AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 900 
CTTTCGCGCA TTTGGATGCC AC CAC TGTGT TGTCCAGAGG TATCTC TGAA 950 

35 TTGGGTATCT ACCCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 1000 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAG AGC TAG 1150 

AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 12 00 

40 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 1250 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 1283 



45 2 ) INFORMATION FOR SEQ ID NO : 475 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13 803 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 475 
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CGACGAAGGT AACTTGCCAG CTATTTTGAA TGCTTTGACT TTGAAGAACG 50 

GTGACCAAGA C TTGGTTTTG GAAGTTGCTC AACATTTGGG TGAAAACACC 100 

GTCAGGGCTA TTGCTATGGA TGGTACCGAA GGTTTAGTCA GAGGTACTGC 150 

TGTCACTGAT ACCGGTGCTC CAATTTCTGT CCCAGTTGGT AG AGGTAC CT 2 00 

5 TGGGTAGAAT TATCAACGTT GTTGGTGAAC CAATTGATGA CAGAGGTCCA 2 50 

ATTGAATGTA AGGAAAGAAA GCCAATTCAC GCTGAACCAC CTTCATTCGT 3 00 

TGAACAATCT ACTGCTGCCG AAATTTTGGA AACCGGTATT AAGGTTGTCG 3 50 

ATTTGTTGGC TCCTTACGCT AGAGGTGGTA AGATTGGTTT GTTCGGTGGT 400 

GCTGGTGTCG GTAAAACCGT CTTCATCCAA GAATTGATTA ACAACATTGC 4 50 

10 CAAAGCTCAT GGTGGTTTCT CTGTCTTTAC CGGTGTCGGT GAAAGAACCA 500 

GAGAAGGTAA CGATTTGTAC CGTGAAATGA AAGAAACTGG TGTCATCAAC 550 

TTGGAAGGTG ACTCCAAGGT CGCCTTGGTC TTCGGTCAAA TGAACGAACC 600 

ACCTGGTGCT AGAGCTAGAG TTGCTTTGAC TGGTTTGACC ATTGCTGAAT 650 

ACTTCAGAGA TGAAGAAGGT CAAGATGTCT TGTTGTTCAT TGACAACATT 7 00 

15 TTCAGATTCA CCCAAGCTGG TTCCGAAGTG TCTGCTTTGT TGGGTCGTAT 7 50 

TCCATCTGCT GTCGGTTATC AACCAACCTT GGCCACTGAT ATGGGTCTTT 800 

TGCAAGAACG T ATTACC AC C ACCAAGAAAG GTTCTGTTAC TTCTGTCCAA 850 

GCTGTTTATG TCCCAGCCGA TGATTTAACC GATCCAGCTC CAGCTACCAC 900 

CTTCGCTCAC TTGGATGCCA CTACTGTCTT GTCTAGAGGT ATTTCTGAAT 95 0 

20 TGGGTATTTA CCCAGCTGTC GATCCATTGG ATTCTAAATC CAGATTATTG 1000 

GATGCTACTG TTGTTGGTCA AGAACATTAT GATGTTGCCA CTGGTGTTCA 1050 

ACAAACTTTA CAAGCTTACA AGTCCTTGCA AGATATCATT GCTATTTTGG 1100 

GTATGGATGA ATTGTCCGAA GCCGATAAAT TGACTGTCGA AAGAGCTCGT 1150 

AAG ATTC AAA GATTCTTGTC CCAACCATTC GCTGTTGCCG AAGTTTTCAC 1200 

25 TGGTATCCCA GGTAGATTGG TTAGATTACA AG AT AC TGTT AAATCTTTCA 12 50 

AAGATGTCTT GGAAGGTAAA TACGATCACT TGCCAGAAAA 12 90 



3 0 2) INFORMATIO N F OR SEQ ID NO: 4 7 6 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 67 bases 

(B) TYPE: Nucleic acid 

3 5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 



TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTC CTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

GGT AC CGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

AATTAC TGTC CCAGTTGGTC GTGAG AC CTT GGGTCGTATC ATCAACGTTA 200 

50 TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACACCAA GCACAGAAAC 250 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 3 00 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 3 50 

GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGACCGTT 400 

TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 450 

55 TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 500 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 550 

GCTCTCGTCT TCGGTCAGAT GAACGAGCCA CCAGGTGCCC GTGCCCGTGT 600 

CGCTTTGACC GGTTTGACCA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 

AGGATGTCTT GTTGTTTATT GACAACATTT TCAGATTCAC CCAGGCCGGT 700 

60 TCCGAGGTGT CTGCCTTGCT TGGTCGTATT CCATCCGCTG TCGGTTACCA 750 
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GCCAACTTTG GCCACCGATA 
CCCAGAAGGG TTCCGTCACT 
GATTTGACTG ATCCTGCTCC 
CACTGTGTTG TCCCGTGGTA 
5 AC C C ATTGG A CTCCAAGTCC 
GAGCACTACA ACACCGCCAC 
GTCTCTCCAG GATATCATTG 
CTGACAAGTT GACTGTCGAG 
CAGCCATTCG CTGTCGCTGA 
10 TAGACTTCAG G AC AC CATC A 
ACGACC AC TT GCCAGAG 



TGGGTTTGTT GCAGGAGAGA 
TCTGTCCAGG CCGTTTACGT 
AGCCACCACT TTCGCCCACT 
TCTCTGAGTT GGGTATTTAC 
AGATTGTTGG ACGCTGCCGT 
CGATGTCCAG CAGACCCTTC 
CTATTTTGGG TATGGATGAG 
AGAGCCAGAA AGATTCAGCG 
GGTTTTCACC GGTATCCCAG 
AGTCCTTCAG AGAGGTTTTG 



ATTACCACCA 800 

CCCAGCTGAT 85 0 

TGGACGCCAC 900 

CCAGCTGTCG 95 0 

TGTTGGTGAC 1000 

AGGCTTACAA 1050 

TTGTCTGAGG 1100 

TTTCCTTTCC 1150 

GTAGATTGGT 12 00 

GACGGTAAGT 12 5 0 
1267 



15 2) INFORMATION FOR SEQ ID NO: 477 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1296 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 2 8269 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 



CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 

AGAACGGTGA "CCAAGACTTG GTCTTGGAAG ~TTGCCCAACA CTTGGGTGAA 100 

AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 15 0 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

35 GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 25 0 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 300 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 350 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 450 

40 CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATAC TT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

45 AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

50 CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 1000 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA AC CTTGC AAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TC C AAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 1200 

55 TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 1296 



60 2) INFORMATION FOR SEQ ID NO: 478 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 478 



TC AATTC GAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCACCTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCC C AGCA CTTGGGTGAG 100 

AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 2 00 

2 0 GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 2 50 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 3 00 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 3 50 

TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 400 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 450 

2 5 CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGAC TC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTG AC CGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTC GTCG AC 7 00 

3 0 AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CACCTTGGCA ACTGATATGG 800 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

GTGCAGGCCG TCTACGTCCC CGC TGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

3 5 CTGAGTTGGG TATCTACCCC GCCGTCGACC C CTTGG ACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 1050 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 12 00 

40 TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 12 50 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 1295 



45 2 ) INFORMATION FOR SEQ ID NO: 479 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479 
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